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Prior to examination of the above-identified new national phase patent application, 
please amend the application, as follows: 

In the Claims 

Please amend claims 1-38 to read as follows: 

1 . A DNA sequence coding for a protein which is involved in the development of 
the nervous system, in particular the CNS, and is expressed in a tissue-specific and 
development-specific manner, wherein the DNA sequence comprises the following 
DNA sequences: 

(a) the DNA sequence of figure 1, figure 2, figure 3, figure 4, figure 5, figure 
6, figure 7 or figure 8; 

(b) the DNA sequence of figure 9 or figure 10; 

(c) the DNA sequence of figure 1 1 ; 

(d) the DNA sequence of figure 12 or figure 13; 

(e) the DNA sequence of figure 14 or figure 15; 

(f) the DNA sequence of figure 16; 

(g) the DNA sequence of figure 17 or 18; 

(h) the DNA sequence of figure 1 9; 

(i) a DNA sequence hybridizing with (a), (b), (c), (d), (e), (f)> (g) or (h) 
(j)fragments, variants, functional equivalents, derivatives or precursors of the 
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DNA sequence of (a), (b), (c), (d), (e), (f), (g), (h) or (i); or 

(k) a DNA sequence which differs from the DNA sequence of (a), (b), (c), 
(d), (e), (f), (g), (h), (i) or (j) due to the degeneration of the genetic 
code. 

2. The DNA sequence according to claim 1, which codes for a protein or peptide 
comprising the amino acid sequence of figure 1, figure 9, figure 11, figure 12, 
figure 13, figure 14, figure 15, figure 16, figure 17, figure 18 or figure 19, 
wherein the protein or peptide has the biological activity defined in claim 1 . 

3. An antisense RNA, characterized in that it is complementary to the DNA 
sequence of claim 1 and can reduce or inhibit the synthesis of the protein encoded 
by this DNA sequence. 

4. A ribozyme, characterized in that it is complementary to the DNA sequence of 
claim 1 and can bind specifically to the RNA transcribed by this DNA sequence 
and can cleave it so as to reduce or inhibit the synthesis of the protein encoded 
by this DNA sequence. 

5. An expression vector, containing the DNA sequence selected from the group 
consisting of the protein according to claim 1 the antisense RNA according to 
claim 3 or the ribozyme according to claim 4. 

6. The expression vector according to claim 5, which comprises additionally the 

promoter of the human T gene or an ortholog of the T gene. 

7. An expression vector according to claim 6, which codes for a protein selected 

from the group consisting of T, T2, T3 proteins or for fragments thereof in the 
form of a reporter fusion protein. 

8. A host cell which is transformed with an expression vector selected from the 
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group consisting of the expression vector of claim 5, claim 6 and claim 7. 

9. A protein which is encoded by the DNA sequence according to claim 1 and 

which is involved in the development of the nervous system and is expressed in 
tissue-specific and development-specific manner, or fusion proteins, fragments, 
variants, derivatives or precursors of the protein 

10. Protein according to claim 9, which has one of the following motives: 
Motive 1: 

(A,T) (I,P,V) (L,T) (G,A,Q) (L,V)XXX(L,V) 
Motive 2: 
IYTDQWAN 
Motive 3: 

AXXXXXXXXXGXXXXXXAXXXXXXXXXXXXXXXXXXQ 
Motive 4: 

SXXXXDX (12,20) KX (17, 22)AXXXXXXXXL 
Motive 5: 

I YTDW AN X X LX (K, R) 
Motive 6: 

KX(18,21)AXXXXXXXXLX(15,24) S 
Motive 7: 

NX (3,1 1) SXXXAXXXXXXXL 
wherein X every amino acid 

(A,T) = amino acid A or T at this site 
X(number 1, number 2) = number 1 to number 2 
Xs at this site. 

A method of producing the protein according to claim 9, which comprises 
culturing the host cell according to claim 8 under suitable conditions and 
obtaining the protein from the cell or the culture medium. 

Antibody which is directed against the protein according to claim 9 or fragment 



11. 



12. 
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thereof. 

13. Antibody according to claim 12, which is obtained by immunizing animals with a 

peptide having the sequence 
"EKGEDPETRRMRTVKNIADI" 

14. A method for preventing or treating diseases of the nervous system by using a 
member selected from the group consisting of the DNA sequence according to 
claim 1, the antisense RNA according to claim 3, the ribozyme according to 
claim 4, the expression vector according to any of claims 5 to 7, the protein 
according to claim 9 and the antibody or the fragment thereof according to claim 
12 or 13 for preventing or treating diseases of the nervous system, in particular 
of the CNS. 

15. The method according to claim 14, wherein the disease of the nervous system is 
a tumoral disease of the CNS. 

16. The method according to claim 14, wherein the treatment of diseases of the 
nervous system are the promotion of the neuronal regeneration in the case of 
injuries of the nervous system and degenerative diseases of the nervous system. 

17. The method according to claim 14, wherein the treatment of diseases of the 
nervous system are the regeneration of the neuronal linkages and the 
regeneration of the innate and acquired malfunctions of the nervous system. 

18. The method according to claim 15 for inhibiting the growth and spreading of 
tumor cells. 

19. Diagnostic method for detecting a disturbed expression of the protein according 
to claim 9 or for detecting a changed form of this protein, in which a sample is 
contacted with a member selected from the group consisting of the DNA 
sequence according to claim 1. the DNA sequence according to claim 2, the 
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antibody or the fragment thereof according to claim 12, and the antibody or the 
fragment thereof of claim 13 and then it is determined directly or indirectly 
whether the concentration of the protein and7or its amino acid sequence differs 
from a protein obtained from a healthy patient. 

20. Diagnostic kit for carrying out the method according to claim 1 9, which contains 

at least one member selected from the group consisting of the DNA sequence 
according to claim 1, the DNA sequence according to claim 2, the antibody or 
the fragment thereof according to claim 12, and the antibody or the fragment 
thereof according to claim 13. 

21. Non-human mammal whose naturally occurring T, T2 or T3 gene comprises a 

change in the gene structure or the gene sequence. 

22. Non-human mammal, wherein a change of the gene structure of the T, T2 or G3 

gene is achieved in the mammal by introducing a deletion in place of which a 
homologous or heterologous sequence is introduced. 

23. Non-human mammal, wherein a change of the gene structure of the T, T2 or C3 
gene is achieved by inserting a homologous or heterologous sequence in the 
corresponding gene naturally occurring in the mammal. 

24. Non-human mammal according to claim 22, wherein the heterologous sequence 

is the selection marker sequence. 

25. Non-human mammal according to claim 24, wherein the selection marker 

sequence conveys resistance to neomycin. 

26. A method of producing a non-human mammal selected from the group consisting 

of the non-human mammal according to claim 21, claim 22, claim 23, claim 24 
and claim 25, characterized by the steps of: 

(a) producing a DNA fragment, in particular a vector, containing a changed 
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T, T2 or G3 gene, the T, T2 or T3 gene having been modified by 
inserting a heterologous sequence, in particular a selectable marker; 

(b) preparing embryonal stem cells from a non-human mammal (preferably a 
mouse); 

(c) transforming the embryonal stem cells from step (b) with the DNA 
fragment from step (a) ? the T gene in the embryonal stem cells being 
changed by homologous recombination with the DNA fragment from (a) 

(d) culturing the cells from step (c), 

(e) selecting the cultured cells from step (d) for the presence of the 
heterologous sequence, in particular the selectable marker, 

(f) producing chimeric non-human mammals from the cells of step (e) by 
injecting these cells into mammalian blastocysts (preferably mouse 
blastocysts), transferring the blastocysts to pseudo-pregnant female 
mammals (preferably mouse) and analyzing the resulting offspring for a 
change of the T, T2 or T3 gene. 

27. Transgenic cell or tissue which is capable of expressing a T protein or part of the 
T protein or an ortholog thereof. 

28. A method for the analysis of the function of the T gene family by using a 
member selected from the group consisting of the non-human mammal 
according to claim 21, claim 22, claim 23, claim 24, claim 25 the transgenic cell 
of claim 27 or the transgenic tissue according to claim 27. 

29. A method for identfifying inhibitors and enhancers of the T gene family by using 
the non-human mammal according to claim 21, claim 22, claim 23, claim 24, 
claim 25, the transgenic cell according to claim 27 or the transgenic tissue 
according to claim 27. 

30. Vertebrate gene and functional equivalent, derivative or a bioprecursor thereof, 
which code for a protein having a statistically significant amino acid sequence 
homology to the T gene, T2 gene or T3 gene according to any of the following 
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figures selected from the group consisting of: figure 1, figure 9, figure 11, figure 
12, figure 13, figure 14, figure 15, figure 16, figure 17, figure 18 or figure 19. 

31. T gene and its vertebrate orthologs and vertebrate paralogs which code for a 

nuclear pore protein. 

32. Vertebrate protein which has an amino acid sequence according to figure 1 or an 

amino acid sequence which differs from the amino acid sequence in figure 1 by 
one or more amino acids. 

33. Vertebrate T, T2 or T3 gene and the protein encoded therein in all of its naturally 
occurring allelic and mutated forms. 

34. Medicament containing a protein according to claim 9 or a functional equivalent, 
a fragment or a bioprecursor thereof in combination with a pharmaceutically 
acceptable carrier. 

35. The method of identifying substances which has an enhancing or inhibiting 
influence on the effect of T protein, T2 protein or T3 protein, by means of 
determining the bi-directional transport through the nuclear pores, 

determining the bindinq to filaments of the cell (e.g. actin filaments and 
microtubuili) or 

determining the increased or reduced transcription of cellular or reporter genes. 

Method of identifying substances which have an enhancing or inhibiting 
influence on the effect of proteins which are functionally linked to the T protein 
in direct or indirect way, or represent parallel signal or functional pathways, by 
means of 

determining the bi-directional transport through the nuclear pores, 



36. 
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determining the phosphorylation and the dephosphorylation of proteins, 

determining the binding of the T protein to filaments of the cell (e.g. 
actin filaments and microtubuli), or 

determining the increased or reduced transcription of cellular or reporter 
genes. 

37. The method according to claim 35, wherein the modified transcription with 
reporter molecules, preferably the occurrence of certain mRNAs or the EGEP 
protein, is detected. 

38. The method of identifying further proteins which play a role in the development 
and function of the nervous system and/or are a nuclear pore protein, wherein 
the method comprises the steps of: 

(a) producing an antibody against a protein according to claim 9, 

(b) contacting a cell extract with the antibody and identifying the 
antibody/protein complex, 

(c) analyzing the complex to identify a protein which has bound to the 
protein of the complex and is no antibody, and 

(d) optionally repeating steps (a) to (c) to identify further proteins of this 
function. 
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REMARKS 



A marked-up version of amended paragraph in the specification and amended claims 1- 
38 are included herewith in Appendix A. 

It is requested that the examination and prosecution of this application proceed on the 
basis of the English translation of the PCT International application included herewith 
and these amended claims 1-38. 



INTELLECTUAL PROPERTY/ 
TECHNOLOGY LAW 
P. O. Box 14329 

Research Triangle Park, NC 27709 
Phone: (919)419-9350 
Fax: (919)419-9354 
Attorney File: 4121-129 



Respectfully submitted, 




Marianne Fuierer 



Registration No. 39,983 
Attorney for Applicants 
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APPENDIX A 

1. A DNA sequence coding for a protein which is involved in the development of the 
nervous system, in particular the CNS, and is expressed in a tissue-specific and 
development-specific manner, wherein the DNA sequence comprises the following DNA 

sequences: 1 

(a) the DNA sequence of figure 1, figure 2, figure 3, figure 4, figure 5, figure 
6, figure 7 or figure 8; 

(b) the DNA sequence of figure 9 or figure 10; 

(c) the DNA sequence of figure 1 1 ; 

(d) the DNA sequence of figure 12 or figure 13; 

(e) the DNA sequence of figure 14 or figure 15; 

(f) the DNA sequence of figure 16; 

(g) the DNA sequence of figure 17 or 18; 

(h) the DNA sequence of figure 19; 

(i) a DNA sequence hybridizing with (a), (b), (c), (d), (e), (f), (g) or (h) 

(j) fragments, variants, functional equivalents, derivatives or precursors of the 
DNA sequence of (a), (b), (c), (d), (e), (f), (g), (h) or (i); or 

(k) a DNA sequence which differs from the DNA sequence of (a), (b), (c), 
(d), (e), (f), (g),(h),(i) or (j) due to the degeneration of the genetic code. 
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3. An antisense RNA, characterized in that it is complementary to the DNA sequence 
of claim 1 [or 2] and can reduce or inhibit the synthesis of the protein encoded by 
this DNA sequence. 

4. A ribozyme [Ribozyme], characterized in that it is complementary to the DNA 
sequence of claim 1 [or 2] and can bind specifically to the RNA transcribed by 
this DNA sequence and can cleave it so as to reduce or inhibit the synthesis of the 
protein encoded by this DNA sequence. 

5. An expression [Expression] vector, containing the DNA sequence selected from 
the group consisting of the protein according to claim 1 [or 2 or coding for] the 
antisense RNA according to claim 3 or the ribozyme according to claim 4. 

7. An expression [Expression] vector according to claim [5 or] 6, which codes fora 

protein selected from the group consisting of T for the] T, T2^ [or] T3 proteins or 
for fragments thereof in the form of a reporter fusion protein. 

8. A host [Host] cell which is transformed with [the] an expression vector selected 

from the group consisting of the expression vector of claim 5, claim 6 and claim 
7. [according to any of claims 5 to 7.] 

9. A protein [Protein] which is encoded by the DNA sequence according to claim 1 

[or 2] and which is involved in the development of the nervous system and is 
expressed in tissue-specific and development-specific manner, or fusion proteins, 
fragments, variants, derivatives or precursors of the protein 

11. A method [Method] of producing the protein according to claim 9, which 
comprises culturing the host cell according to claim 8 under suitable conditions and 
obtaining the protein from the cell or the culture medium. 
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14. A method for preventing or treating diseases of the nervous system by using a 
member selected from the group consisting of PUse of ]the DNA sequence 
according to claim 1 [or 2], the antisense RNA according to claim 3, the ribozyme 
according to claim 4, the expression vector according to any of claims 5 to 7, the 
protein according to claim 9 and fori the antibody or the fragment thereof 
according to claim 12 or 13 for preventing or treating diseases of the nervous 
system, in particular of the CNS. 

15. The method [Use] according to claim 14, wherein the disease of the nervous 
system is a tumoral disease of the CNS. 

16. The method [Usel according to claim 14, wherein the treatment of diseases of the 
nervous system are the promotion of the neuronal regeneration in the case of 
injuries of the nervous system and degenerative diseases of the nervous system. 

17. The method [Use] according to claim 14, wherein the treatment of diseases of the 
nervous system are the regeneration of the neuronal linkages and the regeneration 
of the innate and acquired malfunctions of the nervous system. 

18. The method [Use] according to claim 15 for inhibiting the growth and spreading 
of tumor cells. 

19. Diagnostic method for detecting a disturbed expression of the protein according to 
claim 9 or for detecting a changed form of this protein, in which a sample is 
contacted with a member selected from the group consisting of the DNA sequence 
according to claim 1. the DNA sequence according to claim 2, [or 2 or] the 
antibody or the fragment thereof according to claim 12, and the antibody or the 
fragment thereof of claim [or] 13 and then it is determined directly or indirectly 
whether the concentration of the protein and/or its amino acid sequence differs 
from a protein obtained from a healthy patient. 
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20. Diagnostic kit for carrying out the method according to claim 19, which contains at 
least one member selected from the group consisting of the DNA sequence 
according to claim 1, the DNA sequence according to claim [or] 2, [and/or] the 
antibody or the fragment thereof according to claim 12, and the antibody or the 
fragment thereof according to claim [or] 13. 

24. Non-human mammal according to claim 22 [or 23], wherein the heterologous 
sequence is the selection marker sequence. 

26. A method of producing a non-human mammal selected from the group consisting 
of the non-human mammal according to claim 21, claim 22, claim 23, claim 24 
and claim 25,[ to any of claims 21 to 25], characterized by the steps of: 

(a) producing a DNA fragment, in particular a vector, containing a changed T, 
T2 or G3 gene, the T, T2 or T3 gene having been modified by inserting a 
heterologous sequence, in particular a selectable marker; 

(b) preparing embryonal stem cells from a non-human mammal (preferably a 
mouse); 

(c) transforming the embryonal stem cells from step (b) with the DNA 
fragment from step (a)^ the T gene in the embryonal stem cells being 

changed by homologous recombination with the DNA fragment from (a) 

(d) culturing the cells from step (c), 

(e) selecting the cultured cells from step (d) for the presence of the 
heterologous sequence, in particular the selectable marker, 

(f) producing chimeric non-human mammals from the cells of step (e) by 
injecting these cells into mammalian blastocysts (preferably mouse 
blastocysts), transferring the blastocysts to pseudo-pregnant female 
mammals (preferably mouse) and analyzing the resulting offspring for a 
change of the T, T2 or T3 gene. 

28. A method for the analysis of the function of the T gene family by using a member 
selected from the group consisting of the [Use of the] non-human mammal 
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according to [any of claims] claim 21 , claim 22, claim 23, claim 24, claim [to] 25 [ 
or] the transgenic cell of claim 27 or the transgenic tissue according to claim 27. 
[for the analysis of the function of the T gene family.] 

29. A method for identifying inhibitors and enhancers of the T gene family by using 
[Use of] the non-human mammal according to claim 21. claim 22, claim 23, cla im 
24, claim 25, [to any of claims 21 to 25 or] the transgenic cell according to claim 
27 or the transgenic tissue according to claim 27. [for identifying inhibitors and 
enhancers of the T gene family.] 

30. Vertebrate gene and functional equivalent, derivative or a bioprecursor thereof, 
which code for a protein having a statistically significant amino acid sequence 
homology to the T gene, T2 gene or T3 gene according to any of the following 
figures selected from the group consisting of : figure 1, figure 9, figure 11, figure 
12, figure 13, figure 14, figure 15, figure 16, figure 17, figure 18 or figure 19. 

37. The method according to claim 35 [or 36], wherein the modified transcription 
with reporter molecules, preferably the occurrence of certain mRNAs or the 
EGEP protein, is detected. 
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The present invention relates to a protein (T protein) and 
to proteins related thereto which are involved in the 
development of the nervous system and are expressed in a 
tissue-specific and development-specific manner, to the 
below described variants of these proteins and to DNA 
sequences coding for these proteins. The present invention 
further relates to antibodies directed against these 
proteins or to fragments thereof as well as to antisense 
RNAs or ribozymes directed against the expression of these 
proteins. Finally, the present invention concerns 
medicaments and diagnostic methods in which the above- 
mentioned compounds are used. 

Mutations in genes playing a part in the development and 
maintenance of the nervous system are of utmost scientific 
and economic significance, since diseases of the nervous 
system, in particular CNS, occur frequently, are often 
characterized by a severe, partly fatal disease process and 
can be treated only to a limited extent thus far. The 
increase in the life expectancy is accompanied by a drastic 
increase in neurological and psychic diseases. The latter 
greatly limit the quality of life of the affected persons 
and cause considerable costs for both the affected person 
and the public. 

Isolating and analyzing genes specific to the nervous system 
offer a good possibility of studying diseases, such as 



Nervous Sy s t em 



schizophrenia, Alzheimer 1 s disease, 
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and mental retardation, and eventually of also being able to 
treat them. 

The present invention is thus based on the technical problem 
of providing products by means of which disturbances in the 
development and function of the nervous system can be 
diagnosed and optionally be treated. 

The solution to this technical problem is achieved by 
providing the embodiments characterized in the claims. 



The subject matter of the present invention is thus a DNA 
sequence coding for a protein which is involved in the 
development and function of the nervous system, in 
particular the CNS, and is expressed in tissue-specific and 
development-specific manner, the DNA sequence comprising the 
following DNA sequences: 

(a) the DNA sequence of figure 1, figure 2, figure 3, 
figure 4, figure 5, figure 6, figure. 7 or figure 
8; 

(b) the DNA sequence of figure 9 or figure 10; 

(c) the DNA sequence of figure 11; 

(d) the DNA sequence of figure 12 or figure 13; 

(e) the DNA sequence of figure 14 or figure 15; 

(f) the DNA sequence of figure 16; 

(g) the DNA sequence of figure 17 or 18; 

(h) the DNA sequence of figure 19; 

(i) a DNA sequence hybridizing with (a), (b) , (c) , 
(d), (e), (f), (g) or (h); 

(j) variants, derivatives, precursors or fragments of 
the DNA sequence of (a), (b) , (c) , (d), (e) , (f), 
(g) , (h) or (i) ; or 
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(k) a DNA sequence differing from the DNA sequence of 
(a), (b), (c), (d) , (e), (f), (g), (h) , (i) or (j) 
due to the degeneration of the genetic code. 

The present invention is based on the isolation of a human 
DNA sequence (referred to as gene "T" or T gene; see figures 
1 to 8, which codes for protein TP), it turning out that the 
protein encoded by this DNA sequence is required in the 
nervous system. In this connection, the expression of the 
gene encoding this protein is increased in the nervous 
system. A sequence analysis showed that it is a new gene. 
Moreover, further genes could be isolated which have 
homologies to this gene (murine gene "T", figures 9 and 10; 
human gene "T2", figure 16; human gene "T3", figures 17 and 
18; murine gene T2, figures 12 and 13; murine gene T3, 
figure 19). The T gene, T2 gene and T3 gene are members of 
the T (gene) family, as shown below, and originate 
preferably from vertebrates, such as man, mouse or rat. 
Defects in these genes limit the functions of, the nervous 
system, in particular the CNS. These genes also perform an 
important function in the control of cell growth, and 
changes in these genes or their expression result in defects 
regarding the control of cell growth, e.g. also in tumor 
formation, in particular of the neuroblastoma. Small 
children up to the age of 8 are affected almost exclusively 
by this cancerous disease. The first symptoms already occur 
within the first 12 months of life in 25 to 30 percent of 
the cases. In the case of the neuroblastoma very young cells 
of the autonomous nervous system degenerate. Since these 
nerves extend along the rear side of the abdominal region 
and the chest, neuroblastomas usually occur in the regions 
of the stomach, pelvis, chest and neck. More than half the 
diseases start from the suprarenal marrow which is also 
formed by nerve cells. Symptoms which may refer in small 
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children to a neuroblastoma are nodes, swellings, bone pain, 
limping, tiredness, fever, paleness, sweating, obstinate 
or persistentcough, hematomas around the eye. A 
neuroblastoma can be diagnosed by a physician by means of 
blood tests, urine analyses and ultrasonic examinations and 
by the removal of biopsies from the tumor and an examination 
of bone marrow. As soon as the accurate location of the 
tumor is diagnosed, it is removed by means of an operation. 
However, the early formation of metastases creates a 
problem. By isolating and analyzing the T gene it is now 
possible to develop novel measures of diagnosing and 
treating the neuroblastoma. Due to this, it is possible to 
diagnose the cancerous disease early and establish forms of 
therapy promising better chances of recovery. 

Mutations in genes of the T gene family also lead to a 
disturbed development and differentiation of the nervous 
system, in particular the brain. In many cases, this results 
in mental diseases, e.g. mental retardations or Alzheimer's 
disease. The T gene also plays an important role in the 
interconnection of individual regions of the brain, e.g. 
forebrain and midbrain. Mutations in this gene lead in some 
cases to schizophrenic diseases and syndromes of autism. By 
means of the human and murine genes it is possible to draw 
important fundamental conclusions as to the development of 
the nervous system and in particular the brain. Good 
approaches offer themselves as regards the research of 
pathologic changes of the nervous system and in particular 
the brain. 

Patients can be examined more simply for possible mutations 
by means of the genomic sequences. The genomic sequences of 
the T gene are of advantage in particular when little 
(tumor) material is available for the analysis. By this it 



f 1 {| 3 «3 ;:L ES H» In o & o Esl O «■»!: 



is possible, for example, to examine even minute tumors for 
mutations in this gene. This also provides the possibility 
of checking a therapy (in particular radiation therapy 
and/or chemotherapy) for its being successful, since it is 
possible to detect tumor cells circulating in the blood by 
genomic primers which are specific to the genomic DNA using 
a PCR reaction. 

The term "hybridizing" used in the present invention relates 
to conventional hybridization conditions, preferably to 
hybridization conditions which use 5xSSPE, 1 % SDS, 
lxDenhardt f s solution as the solution and where 
hybridization temperatures are between 35°C and 70°C, 
preferably 65°C. Following hybridization, washing is 
preferably carried out using first 2xSSC, 1 % SDS and then 
0 . 2xSSC at temperatures between 35°C and 70°C, preferably of 
65°C (regarding a definition for SSPE, SSC and Denhardt 1 s 
solution see Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2 nd edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989)). 
Stringent hybridization conditions are particularly 
preferred, as described in Sambrook et al., supra, for 
example . 

The terms "variants" or "fragment" used in the present 
invention comprise DNA sequences which differ from the 
sequences indicated in the figures by deletion (s), 
insertion (s) , substitution (s) and/or other modifications 
known in the art or comprise a fragment of the original 
nucleic acid molecule, the protein or peptide encoded by 
these DNA sequences still having the above-mentioned 
properties. Therefore, functional equivalents, derivatives, 
precursors (bioprecursors ) are counted among them. 
Derivatives are understood to mean e.g. mutation derivatives 
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(produced by deletions or insertions, for example) , fusions, 
allel variants, muteins and splicing variants. Two select 
examples of such splicing variants are shown in figures 14 
and 15. Methods of producing the above changes in the 
nucleic acid sequence are known to a person skilled in the 
art and are described in standard works of molecular 
biology, e.g. in Sambrook et al., supra. The person skilled 
in the art is also capable of determining whether a protein 
encoded by a nucleic acid sequence modified in such a way 
still has the above-mentioned properties. 

In a preferred embodiment, the present invention relates to 
a DNA sequence which encodes a protein comprising the amino 
acid sequence of figure 1, figure 9, figure 11, figure 12, 
figure 13, figure 14, figure 15, figure 16, figure 17, 
figure 18 or figure 19, the protein having the above-defined 
biological activity. 

By lowering or inhibiting the expression of the above 
described DNA sequences it is possible to reduce or 
eliminate the synthesis of the proteins encoded by them, 
e.g. the T protein, which is desirable for certain states of 
a disease, for example. Therefore, another preferred 
embodiment of the present invention relates to antisense 
RNA, which is characterized in that it is complementary to 
the above DNA sequences and can reduce or inhibit the 
synthesis of the protein encoded by these DNA sequences and 
to a ribozyme, which is characterized in that it can bind 
specifically to part of the above DNA sequences and to the 
RNA transcribed by these DNA sequences and can cleave them 
so as to reduce or inhibit the synthesis of the protein 
encoded by these DNA sequences. These antisense RNAs and 
ribozymes are preferably complementary to a coding region of 
the mRNA. Based on the disclosed DNA sequences, the person 




skilled in the art can produce and use suitable antisense 
RNAs . Suitable methods are described in EP-Bl 0 223 399 or 
EP-A1 0 458, for example. Ribozymes are RNA enzymes and 
consist of a single RNA strand. They can cleave 
intermolecularly other RNAs , e.g. the mRNAs transcribed by 
the DNA sequences according to the invention. These 
ribozymes must, in principle, have two domains: (1) a 
catalytic domain and (2) a domain which is complementary to 
the target RNA and can bind thereto, which is a precondition 
for a cleavage of the target RNA. Based on the methods 
described in the literature, it is meanwhile possible to 
construct specific ribozymes which excise a desired RNA at a 
certain pre-select site (see e.g. Tanner et al., in: 
Antisense Research and Applications, CRC Press, Inc. (1993), 
415-426) . 



The DNA sequences according to the invention or the DNAs 
encoding the above described antisense RNAs or ribozymes may 
also be inserted in a vector or expression vector. Thus, the 
present invention also comprises vectors or expression 
vectors containing these DNA sequences. The term "vector" 
relates to a plasmid (e.g. pUC18, pBR322, pBlueScript) , to a 
virus or another suitable vehicle. In a preferred 
embodiment, the DNA molecule according to the invention is 
functionally linked in the vector to regulatory elements 
allowing the expression thereof in prokaryotic or eukaryotic 
host cells. Along with the regulatory elements, e.g. a 
promoter, such vectors contain typically a replication 
origin and specific genes which allow the phenotypic 
selection of a transformed host cell. The lac,trp promoter 
or the T7 promoter are counted among the regulatory elements 
for the expression in prokaryotes, e.g. E. coll, those for 
the expression in eukaryotes comprise the AOX1 or GAL1 
promoter in yeast, and those for the expression in animal 
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cells include the CMV, SV40, RVS40 promoter, CMV or SV40 
enhancer. Further examples of suitable promoters are the 
metallothionein I promoter and the polyhedrin promoter. In a 
preferred embodiment the vector contains the promoter of the 
human T gene or an ortholog of the T gene. Suitable 
expression vectors for E. coll are e.g. pGEMEX, pUC 
derivatives, pGEX-2T, pET3b and pQE-8, the latter being 
preferred. Suitable vectors for the expression in yeast 
comprise pYlOO and Ycpadl, and suitable vectors for the 
expression in mammalian cells include pMSXND, pKCR, pEFBOS, 
cDM8 and pCEV4 . Vectors derived from baculovirus for 
expression in insect cells, e.g. pAcSGHisNT-A, are also 
counted among the expression vectors according to the 
invention . 

General methods known in the art can be used for 
constructing expression vectors which contain the DNA 
sequences according to the invention and suitable control 
sequences. These methods comprise e.g. In vitro 
recombination techniques, synthetic methods, and In vivo 
recombination techniques, as described in Sambrook et al., 
supra, for example. The DNA sequences according to the 
invention can also be inserted in combination with a DNA 
coding for another protein or peptide, so that the DNA 
sequences according to the invention can be expressed in the 
form of a fusion protein, for example. These other DNAs are 
preferably reporter sequences which code for a reporter 
molecule comprising a detectable protein, e.g. a stain or 
coloring matter, an antibiotic resistance, ft-galactosidase 
or a substances detectable by spectrophotometric, 
spectrof luorometric, luminescent or radioactive assays. 

The present invention also relates to host cells containing 
the above described vectors. These host cells comprise 
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bacteria (e.g. the E. coli strains HB101, DH1 , Kill 6 r JM101, 
JM109, BL21 and SG13009) , fungi, e.g. yeasts, preferably S. 
cerevisiae, plant cells, insect cells, preferably sf9 cells, 
and animal cells, preferably cells from vertebrates or 
mammals. Preferred mammalian cells are CHO, VERO, BHK, HeLa, 
COS, MDCK, 293 or WI38 cells. Methods of transforming these 
host cells for the phenotypic selection of transf ormant s and 
for the expression of the DNA molecules according to the 
invention using the above-described vectors are known in the 
art . 

The genes belonging to the sequences according to the 
invention can be amplified by suitable primer sequences. The 
primer sequences indicated in figure 20 are particularly 
suited for amplification of genes T2 and T3. 

The present invention also relates to the proteins encoded 
by the DNA sequences according to the invention and to 
methods of producing the protein encoded by the DNA 
sequences according to the invention. The person skilled in 
the art is familiar with conditions of culturing transformed 
or transf ected host cells. The method according to the 
invention comprises the culturing of the above described 
host cells under conditions which allow the expression of 
the protein (or fusion protein) (preferably stable 
expression) and the collection of the protein from the 
culture or from the host cells. Suitable purification 
methods (e.g. preparative chromatography, affinity 
chromatography, e.g. immunoaf f initychromatography , HPLC, 
etc.) are generally known. 

The proteins according to the invention preferably comprise 
the amino acid sequences shown in figure 1, figure 9, figure 
11, figure 12, figure 13, figure 14, figure 15, figure 16, 
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figure 17, figure 18 or figure 19 or represent fusions, 
fragments, derivatives or precursors (bioprecursors ) 
thereof, the above mentioned properties being maintained 
within the meaning of functional equivalents. As to the 
definitions of these terms, reference is made to the 
respective explanations above. Derivatives are understood to 
mean in particular the changed proteins or peptides which 
differ from the sequences shown in the figures by 
conservative amino acid substitutions or contain non- 
conserved amino acid substitutions that do not change the 
function of the T proteins to a substantial degree. 

The following amino acid motives have been identified by 
Inventors. They are suited to identify formerly unknown 
proteins which belong to the T/T2/T3 family according to the 
invention and a protein superf amily from pore membrane 
proteins and filament-binding proteins. 
Motive 1: 

(A,T) (I,P,V) (L,T) (G, A, Q) ( L , V ) XXX ( L , V) 

Motive 2: 
IYTDWAN 
Motive 3: 

AXXXXXXXXXGXXXXXXAXXXXXXXXXXXXXXXXXXQ 
Motive 4: 

SXXXXDX (12, 20) KX (17, 22 ) AXXXXXXXXL 

Motive 5: 

I YT DWANXXLX ( K , R ) 

Motive 6: 

KX ( 1 8 , 2 1 ) AXXXXXXXXLX ( 1 5 , 2 4 ) S 
Motive 7: 

NX ( 3 , 1 1 ) SXXXAXXXXXXXL 

Explanation : X stands for every amino acid 

(A, T ) means amino acid A or T at this site 
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X(2,4) denotes two to four Xs at this site 

Another preferred embodiment of the present invention 
relates to antibodies against the above described proteins 
according to the invention or to a fragment thereof. These 
antibodies may be monoclonal, polyclonal or synthetic 
antibodies or fragments thereof. In this connection, the 
term "fragment" means all parts of the monoclonal antibody 
(e.g. Fab, Fv or "single chain Fv" fragments) which have an 
epitope specificity the same as that of the complete 
antibody. The person skilled in the art is familiar with the 
production of such fragments. 

The antibodies according to the invention are preferably 
monoclonal antibodies. The antibodies according to the 
invention can be produced according to standard methods, the 
protein encoded by the DNA sequences according to the 
invention or a synthetic fragment thereof serving as an 
immunogene. Methods of obtaining monoclonal antibodies are 
known to the person skilled in the art and comprise e.g. as 
a first step the production of polyclonal antibodies using 
the proteins according to the invention or fragments thereof 
(synthetic peptides, for example) as an immunogene for 
immunizing suitable animals, e.g. rabbits or chickens, and 
the collection of the polyclonal antibodies from the serum 
or egg yolk. 

For example, cell hybrids from cells producing antibodies 
and tumor cells from bone marrow are then produced and 
cloned. Thereafter, a clone is selected which produces an 
antibody specific to the antigen used. This antibody is then 
produced. Examples of cells producing antibodies are spleen 
cells, lymph node cells, B lymphocytes, etc. Examples of 
animals which can be immunized for this purpose are mice, 



rats, horses, goats and rabbits. The myeloma cells can be 
obtained from mice, rats, humans or other sources. The cell 
fusion can be carried out by the generally known method 
developed by Kohler and Milstein, for example. The 
hybridomas obtained by cell fusion are screened using the 
antigen according to the enzyme-antibody method or according 
to a similar method. Clones are obtained with the boundary 
dilution method, for example. The resulting clones are 
implanted intraperitoneally into BALB/c mice, for example, 
the mouse ascites is removed after 10 to 14 days, and the 
monoclonal antibody is purified by known methods (e.g. 
ammonium sulfate fractionation, PEG fractionation, ion 
exchange chromatography, gel chromatography or affinity 
chromatography) . 



In a particularly preferred embodiment, said monoclonal 
antibody is an antibody originating from an animal (e.g. 
mouse) , a humanized antibody or a chimeric antibody or a 
fragment thereof. Chimeric antibodies similar to human 
antibodies or humanized antibodies have a reduced potential 
antigenicity, however, their affinity is not lowered over 
the target. The production of chimeric and humanized 
antibodies or of antibodies similar to human antibodies has 
been described in detail (see e.g. Queen et al., Proc. Natl. 
Acad. Sci., U.S.A. 86 (1989), 10029, and Verhoeyan et al., 
Science, 239 (1988), 1534). Humanized immunoglobulins have 
variable framework regions which originate substantially 
from a human immunoglobulin (designated acceptor 
immunoglobulin) and the complementarity of the determining 
regions which originate substantially from a non-human 
immunoglobulin (e.g. from a mouse) (designated donor 
immunoglobulin). The constant region(s) originate (s) , if 
available, also substantially from a human immunoglobulin. 
When administered to human patients, humanized (and the 
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human) antibodies have a number of advantages over 
antibodies from mice or other species: (a) the human immune 
system should not regard the framework or the constant 
region of the humanized antibody as foreign and therefore 
the antibody response to such an injected antibody should be 
less than to that to a completely foreign mouse antibody of 
a partially foreign chimeric antibody; (b) since the 
effector region of the humanized antibody is human, it might 
interact better with other parts of the- human immune system, 
and (c) injected humanized antibodies have a half life which 
is substantially equivalent to that of human antibodies 
occurring in nature, which permits the administration of 
doses smaller and less frequent as compared to antibodies of 
other species. 



The antibodies according to the invention can be used for 
the immunoprecipitation of the above discussed proteins, for 
the isolation of related proteins from cDNA expression 
libraries or for the below indicated purposes 
(diagnosis/therapy), for example. 



The present invention also relates to a hybridoma which 
produces the above described monoclonal antibody. 



In a preferred embodiment, the present invention relates to 
antibodies against the peptides of genes T2 and T3 listed 
separately (cf. figure 20) . 

It has been found that the below peptide can be used 
specifically for generating antibodies against the T 
protein. The amino acid sequence of the suitable peptide 
reads as follows: 



EKGEDPETRRMRTVKNIAD 
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The present invention makes possible to study disturbances 
in the development and function of the nervous system on a 
genetic level. These disturbances comprise inter alia 
neurological and psychiatric diseases {inter alia 
Alzheimer's disease, Parkinson's disease, schizophrenia, 
manic-depressive diseases, autism, mental retardations) , 
injuries of the nervous system, innate damage of the nervous 
system or degenerative diseases of the- nervous system. The 
invention also enables the treatment of cancer, inter alia 
of tumors of the nervous system, such as neuroblastoma, 
astrocytoma, glioblastoma, medulloblastoma . This diagnosis 
cannot only be made postnatally but already prenatally. It 
can be detected by means of the DNA sequence according to 
the invention or probes or primers derived therefrom whether 
mammals, in particular humans, contain a gene which codes 
for and/or expresses the protein according to the invention 
or whether this gene results in a mutated form of the 
protein which is no longer biologically active. For this 
purpose, the person skilled in the art can carry out common 
methods, such as reverse transcription, PCR, LCR, 
hybridization and sequencing. The antibodies according to 
the invention are also suited e.g. for diagnosis, i.e. for 
detecting in a sample the presence and/or concentration of 
the protein according to the invention, a shortened or 
extended form of the protein, etc. The antibodies can be 
bound e.g. in immunoassays in liquid phase or to a solid 
carrier. In this case, the antibodies can be labeled in 
various ways. Suitable markers and labeling methods are 
known in the art. Examples of immunoassays are ELISA and 
RIA. 

Thus, the present invention also relates to a diagnostic 
method for detecting a disturbed expression of the protein 




according to the invention or for detecting a changed form 
of this protein, in which a sample is contacted with the DNA 
sequences according to the invention or the antibody 
according to the invention or the fragment thereof and then 
it is determined directly or indirectly whether the 
concentration of the protein and/or its amino acid sequence 
differs from a protein obtained from a healthy patient. 

The present invention also allows to carry out therapeutic 
measures in connection with the above discussed 
disturbances, i.e. the above described inventive DNA 
sequences, antisense RNAs, ribozymes and antibodies can also 
be used for producing a medicament, e.g. for controlling the 
expression of the protein according to the invention, or for 
exchanging a mutated form of the gene by a functional form 
and thus also for the production of a medicament for 
preventing or treating diseases of the nervous system, in 
particular tumoral diseases of the CNS . For example, the 
protein according to the invention can be introduced into 
mammals, in particular humans, by common measures. For this 
purpose, it may be favorable to link the protein to a 
protein which is not considered foreign by the respective 
body, e.g. transferrin or bovine serum albumin (BSA) . An 
inventive DNA sequence, antisense RNA or ribozyme can also 
be introduced into mammals, in particular humans, and 
expressed. By means of an antibody according to the 
invention it is possible to control and regulate the 
expression of the protein (TP) according to the invention or 
the related proteins. 

Thus, the present invention also relates to a medicament 
which contains the above described DNA sequences, antisense 
RNA, the ribozyme, the expression vector, the protein 
according to the invention or the antibody or the fragment 
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thereof. This medicament contains, optionally in addition, a 
pharmaceutically compatible carrier. Suitable carriers and 
the formulation of such medicaments are known to the person 
skilled in the art. Suitable carriers are e.g. phosphate- 
buffered common salt solutions, water, emulsions , e.g. oil- 
in-water emulsions , wetting agents, sterile solutions , etc. 
The medicaments can be administered orally or parenterally . 
The topical, intra-arterial , intra-muscular , subcutaneous, 
intramedullary, int rathe ka 1 , intraventricular, intravenous , 
intraperitoneal or intranasal administration are counted 
among the methods for the parenteral administration. The 
suitable dose is determined by the attending physician and 
depends on various factors, e.g. on the age, sex and weight 
of the patient, the stage of the disease, the kind of 
administration, etc. 

The above described nucleic acids are preferably inserted in 
a vector suitable for gene therapy and introduced into the 
cells under the control of a tissue-specific . vector, for 
example. In a preferred embodiment, the vector containing 
the above described nucleic acids is a virus, e.g. an 
adenovirus, vaccinia virus or adenovirus. Retroviruses are 
particularly preferred. Examples of suitable retroviruses 
are MoMuLV, HaMuSV, MuMTV, RSV or GaLV . For the purposes of 
gene therapy, the nucleic acids according to the invention 
can also be transported to the target cells in the form of 
colloidal dispersions. They comprise liposomes or 
lipoplexes, for example (Mannino et al., Biotechniques 6 
(1988) , 682) . 

Finally the present invention relates to a diagnostic kit 
for carrying out the above described diagnostic method, 
which contains a DNA sequence according to the invention or 
the above described antibody according to the invention or a 
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fragment thereof. Depending on the kind of the kit, the DNA 
se q uence or the antibody or the fragment thereof can be 
immobilized . 

Sequences of the T genes can be applied to nylon membranes 
or glass carriers and hybridized with complex cDNA samples 
from tumors and pertinent normal tissues or diseased and 
pertinent healthy tissue. This enables the (fully automated) 
detection of the expression of these genes. The sequences 
used for this purpose can be e.g. the entire cDNA sequence 
or short sequence segments, e.g. 10-15 bp oligomers (see 
inter alia figure 20). Having determined the expression of 
the T genes, the therapy, inter alia the cancer therapy, 
can be selected deliberately according to the respective 
individual situation of the patient or can be adapted 
thereto. Genes whose changed expression influence already 
now the treatment of the patient are the N-myc gene in the 
case of neuroblastoma, for example. By detecting the 
expression of the T genes it is thus possible to adapt the 
treatment very quickly and efficiently to the respective 
requirements and in this way it contributes essentially to 
the improved therapy. 

The isolation and characterization of the human gene 
according to the invention and in particular of the mouse 
homologues thereof also allow to establish an animal model, 
which is very valuable for the further study of diseases of 
the nervous system and of cancerous diseases on a molecular 
level. The subject matter of the present invention thus also 
relates to a non-human mammal whose T gene or T2 or T3 gene 
is changed, e.g. by inserting a heterologous sequence, in 
particular a selection marker sequence. 
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The expression "non-human mammal" comprises any mammal whose 
T gene or T2 or T3 gene can be changed. Examples of such 
mammals are mouse, rat, rabbit, horse, cattle, sheep, goat, 
monkey or ape, pig, dog and cat, with mouse being preferred. 



The expression "T gene or T2 or T3 gene which is changed" 
signifies that a change of the gene structure or the gene 
sequence is carried out by standard methods in the 
corresponding gene occurring naturally in the non-human 
mammal. This can be achieved inter alia by introducing a 
deletion of about 1-2 kb, at the place of which a 
heterologous sequence, e.g. a construct for mediating 
antibiotic resistance (e.g. a "neo cassette") is introduced. 
Heterologous sequences allowing to carry out time-specific 
and tissue-specific deletions in vivo can also be inserted 
in the T gene. Furthermore, heterologous sequences allowing 
to track the expression of the T gene in vivo can be 
introduced into the T gene. This can be done inter alia by 
inserting a sequence coding for the GFP (green fluorescent 
protein) protein inside an exon or as an independent exon . 
These methods are generally described by Schwartzberg et 
al., Proc. Natl. Acad. Sci., U.S.A., Vol. 87, pages 3210- 
3214, 1990, to which reference is made herein. 



In particular, the modification can be described and carried 
out as follows. Figure 9 represents part of the cDNA 
sequence of the T gene of a mouse. Illustration 10 shows an 
intron sequence of the T gene of a mouse, which is flanked 
by two exons . These murine sequences can then be used for 
the deliberate change of the murine T gene. For example, the 
splicing sequences of the intron can be deleted or changed 
such that the T gene is no longer spliced correctly. By 
incorporating a splicing acceptor sequence of another exon 
of the murine T gene into the intron sequence it is possible 
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to insert in this intron a sequence which is recognized as 
exon and is spliced to the T gene exon upstream thereof. 
This inserted sequence may be an exon, for example, which 
encodes the EGFP protein (EnhancedGreenFluorescentProtein) . 
As a result, the original murine T gene becomes a fusion 
protein comprising the EGFP protein. Thus, a mouse can 
preferably be generated, which allows to track the 
expression of the T gene in vivo. The inserted sequence can 
be designed at its end (e.g. PolyA signal, splicing signals, 
etc.) such that no further exons of the T gene are spliced 
to the inserted exon or the spliced exon can no longer be 
translated. As a result, a deletion of the murine T protein 
forms on the C-terminal end or a premature discontinuance of 
the reading frame, and an (at least partial) inactivation of 
the protein function of the murine T gene can be achieved. 
It is also possible to insert, as new exon sequences, 
sequences which yield an mRNA sequence where this new mRNA 
sequence is localized at the 3' end. By suitable sequences 
it is then possible to achieve a change in the stability - of 
the mRNA or a changed localization in the cell. The 
accompanying phenotypes of the thus modified mice can then 
result in important conclusions drawn on the function of the 
X gene. These mice can then also be used for detecting new 
active substances compensating the functional loss of the T 
gene . 

In another preferred embodiment, the sequence of figure 13 
is used for the production of a knock-out mouse. Figure 13 
describes a murine sequence of gene T2 . The elimination of 
the murine T2 genes can in this connection be achieved in 
different ways. For example, the splicing sequence (GT, 
underlined in figure 13) can be deleted or changed such that 
the T2 gene is no longer spliced correctly. In addition, by 
incorporating a splicing acceptor sequence of another exon 




of the murine T2 gene into the following intron sequence it 
is possible to insert in this intron a sequence which is 
detected as exon and spliced to the T2 gene exons upstream 
thereof. This inserted exon may be e.g. an exon which codes 
for the EGFP protein. Due to this, the original murine T2 
gene becomes a fusion protein which carries the EGFP protein 
at the C terminus. In this way, a mouse can be generated 
which allows to track the expression of the T2 gene in vivo. 
The inserted sequence can be designed at its end (e.g. PolyA 
signal, etc.) such that no further exons are spliced to the 
inserted exon by the T2 gene. A deletion of the murine T2 
protein forms at the C-terminal end and an (at least 
partial) inactivation of the protein function of the murine 
T2 gene can be achieved. Such sequences can also be inserted 
as new exon sequences which yield an mRNA sequence in which 
at the 3 1 end this new mRNA sequence is localized. By means 
of suitable sequences it is then possible to achieve a 
change in the stability of the mRNA or a changed 
localization in the cell. The accompanying phenotypes of the 
thus changed mice can then lead to important conclusions as 
to the function of the T2 gene. These mice can also be used 
for detecting new active substances which compensate the 
functional loss of the T gene. 

Furthermore, a mammal can be generated comprising a change 
in the T3 gene. The sequence in figure 19 represents part of 
the murine cDNA sequence of the T3 gene. Deliberate changes 
in the T3 gene of a mouse can be achieved by deletions or 
insertions. The inserted sequence can be an exon, for 
example, which codes for the EGFP protein. As a result, the 
original murine T3 gene becomes a fusion protein which 
carries the EGFP protein at the C terminus. Thus, a mouse 
can be generated which allows to track the expression of the 
T3 gene in vivo. The inserted sequence can be designed at 



O H ii »Q! ,:l «4>5 »+ ,r 9 m O 6. O S O El 

21 

its end (e.g. PolyA signal, etc.) such that no further exons 
are spliced from the T3 gene to the inserted exon. A 
deletion of the murine T3 protein thus forms on the C- 
terminal end and an (at least partial) inactivation of the 
protein function of the murine T3 gene can be achieved. It 
is also possible to insert, as new exon sequences, sequences 
which yield an mRNA sequence where this new mRNA sequence is 
localized at the 3 f end. By suitable sequences it is then 
possible to achieve a change in the stability of the mRNA or 
a changed localization in the cell. The accompanying 
phenotypes of the mice changed in this way can then lead to 
important conclusions as to the function of the T3 gene. 
These mice can then also be used for discovering new active 
substances which compensate the functional loss of the T3 
gene . 

Another subject matter of the present invention are cells 
which are obtained from the above non-human mammal. These 
cells can be present in any form, e.g. in a primary or long- 
term culture. 

A non-human mammal according to the invention can be 
provided by common methods. A method is favorable which 
comprises the steps of: 

(a) producing a DNA fragment, in particular a vector, 
containing a changed T, T2 or G3 gene, the gene having 
been modified by inserting a heterologous sequence, in 
particular a selectable marker; 

(b) preparing embryonal stem cells from a non-human mammal 
(preferably a mouse) ; 
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(c) transforming the embryonal stem cells from step (b) 
with the DNA fragment from step (a) , the T gene in the 
embryonal stem cells being changed by homologous 
recombination with the DNA fragment from (a) ; 

(d) culturing the cells from step (c) ; 

(e) selecting the cultured cells from step (d) for the 
presence of the heterologous sequence, in particular 
the selectable marker; 

(f) producing chimeric non-human mammals from the cells 
from step (e) by injecting these cells into mammalian 
blastocysts (preferably mouse blastocysts) , 
transferring the blastocysts to pseudo-pregnant female 
mammals (preferably mouse) and analyzing the resulting 
offspring for a change of the T gene. 

In step (c) , the mechanism of homologous recombination (cf . 
R.M. Torres, R. Kuhn, Laboratory Protocols for Conditional 
Gene Targeting, Oxford University Press, 1997) is used to 
transfect embryonal stem cells. The homologous recombination 
between the DNA sequences present in a chromosome and new 
added cloned DNA sequences enable the insertion of a cloned 
gene in the genome of a living cell in place of the original 
gene. Using embryonal germ cells, animals which are 
homozygous for the desired gene or the desired gene portion 
or the desired mutation can be obtained via chimeras by this 
method. 

The expression "embryonal stem cells" comprises any 
embryonal stem cells of a non-human mammal, suited to mutate 
the T gene. The embryonal stem cells originate preferably 
from a mouse, in particular the cells E14/1 or 129/SV. 
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The expression "vector" comprises any vector which by 
recombination with the DNA of embryonal stem cells enables a 
change of the T, T2 or G3 gene. The vector preferably has a 
marker by means of which selection can be made for existing 
stem cells in which the desired recombination has been made. 
Such a marker is the loxP/tk neo cassette, for example, 
which can be removed by means of the Cre/loxP system from 
the genome again. 



The person skilled in the art also knows conditions and 
materials serving for carrying out steps (a) - (f ) . 

By means of the present invention a non-human mammal is 
provided whose T, T2 or T3 gene is changed. This change may 
be an elimination of the gene expression-regulating 
function. Using such a mammal or cells therefrom it is 
possible to study selectively the gene expression- 
controlling function of the TP protein. It is also possible 
by this to find substances, medicaments and therapy 
approaches by which selective influence on the controlled 
function is possible. The present invention therefore 
provides a basis for influencing the most differing 
diseases. Such diseases are e.g. limitations of the CNS 
functions which cover mental retardations or the induction 
of cancer resulting from deficiencies in the control of cell 
proliferation. 



Inventors found out in the sequence analysis that the T2 
gene in the coding region of the cDNA sequence contains CGG 
trinucleotides which are known to be sensitive to 
methylation. The T2 gene thus has in the coding region (N- 
terminal region of the protein which has no homology to the 
T protein or T3 protein) a methylat ion-sensitive and 
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unstable sequence which results in the failure of the gene 
accompanied by a mental retardation and uncontrolled cell 
growth, such as cancer. 

All the three genes of the T family play a major role in the 
case of tumors. The T gene is affected in many tumors by 
genomic rearrangements. For example, in neuroblastomas 
genomic changes in the DNA of tumors can be found as 
compared to the DNA of the accompanying healthy tissue. The 
expression of the T gene, e.g. in tumors of the brain, is 
also changed. A strongly changed expression can be found 
inter alia in the advanced stages of glioblastomas. Tumor- 
specific changes of the expression of the T gene and the 
occurrence of the T protein can also be detected in 
meningiomas . 



In many tumoral diseases, the T2 gene also undergoes genomic 
rearrangements, and a changed expression can be detected in 
tumors. For example, in melanomas and lung tumors genomic 
rearrangements of the T2 gene can be detected. Expression 
differences are also detectable in gliomas, glioblastomas, 
astrocytomas and PNETs (Primitive Neuro-Ectodermal Tumors), 
for example . 

In many tumors, the T3 gene also undergoes genomic 
rearrangements and expression changes. Rearrangements can be 
detected in colon carcinomas, for example. Expression 
differences are detectable inter alia in gliomas, 
glioblastomas, astrocytomas and PNETs (Primitive Neuro- 
Ectodermal Tumors) . 



By isolating and accurately analyzing 
now have found that the T protein has 
to proteins which perform completely 



the T gene, Inventors 
a certain relationship 
different functions in 
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the cell. The sequence analysis of these proteins showed 
that the genes coding for these proteins are likely due to a 
common precursor gene or to similar precursor genes. 
Proteins such as the POM121 protein (Hallberg et al., J. 
Cell Biol. 122, pages 513-522, 1993) belong to this 
superfamily. It is one of two known nuclear pore membrane 
proteins in vertebrates. The CLIP-170 protein which binds 
vesicles and other organelles within the cell to microtubuli 
(Pierre et al., Cell 70, pages 887-900, 1992) also belongs 
to this family. The unexpected discovery that genes which 
perform such different tasks inside the cell belong to a 
common protein superfamily is extremely surprising and even 
inconsistent at first sight. However, when the functions of 
the individual genes are analyzed, two main functions of 
these proteins can be derived. The CLIP-170 protein binds to 
microtubuli, the newly isolated T proteins and the POM121 
protein are localized in the nuclear core complex. Due to 
the properties of these proteins, Inventors propose that 
this protein superfamily be referred to as POMIC protein 
superfamily. POMIC shall, in this connection, stand for 
pores and/or mic rotubuli -binding protein. Based on the 
isolation and analysis of the T gene, two paralogs of the T 
gene could be isolated, namely the T2 and T3 genes which are 
described in more detail above. As regards evolution and 
function, the family of the T proteins is between the CLIP 
(cytoplasmic linker protein-170) and the POM121 protein. 
This intermediate position is also supported by the sequence 
analysis and the putative protein structure. The nuclear 
pore membrane protein POM121 has no marked coiled-coil 
structure whereas the CLIP-170 protein shows a very distinct 
coiled-coil structure between the N-terminus and C-terminus 
(cf. figure 29). Coiled-coil structures exist in the family 
of T proteins, however, they are clearly less marked than in 
CLIP-170. A similar intermediate position is adopted by the 
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family of T proteins with respect to the occurrence of 
hydrophobic domains. The POM121 protein has a hydrophobic 
domain at the N-terminus which is introduced into the 
nuclear membrane, and the protein is positioned in the 
nuclear pore. The CLIP-170 protein has no distinct 
hydrophobic domain. The T protein and the T3 protein, 
however, have a hydrophobic domain with three hydrophobic 
partial regions (cf. figure 30) . The exchange of the N- 
terminus in the T2 protein as compared to the evolutionary 
basic form resulted in a loss of this distinct hydrophobic 
domain. Yet all three T proteins have in common the very 
similar structure of the C-terminus . The T3 protein is most 
similar to the T protein within the T protein family. 
However, the T3 protein also has undergone a change in the 
course of evolution. The N-terminus was changed as compared 
to the T protein by insertion of about 400 amino acids. This 
insertion resulted in another coiled-coil structure as 
compared to the otherwise very similar T protein. The T 
protein and the T3 protein perform functions in the nuclear 
membrane-localized form, which are similar to those of 
POM121. However, it is interesting that in the course of 
evolution there was a loss of part of the C-terminus in the 
POM121 protein. As compared to the P0M121 protein, the T 
proteins have a longer C-terminus. Due to this longer C- 
terminus many interactions with other proteins are possible. 
In this connection, it is also worth mentioning that a 
leucine-zipper structure was discovered in the T protein, 
which facilitates interactions with other proteins. The 
family of T protein plays an important role in the mediation 
of interactions between cell organelles and filaments, inter 
alia microtubuli. Microtubuli play an important role e.g. in 
nerve cells; in the case of axons, for example, the plus 
ends of the microtubuli face away from the cell body whereas 
the microtubuli of dendrites have both orientations. This 
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cell polarity is of major importance for the functioning of 
a cell or living being. Microtubuli also provide an 
efficient organelle transport, and they are of essential 
significance for the general organization of membrane 
structures in a cell. The T proteins perform an important 
mediator function between membrane structures and 
microtubuli. The T gene and the T3 gene perform their 
function in particular as a membrane protein in the nuclear 
pore whereas the T2 protein acts particularly as a 
cytoplasmic protein . 

Due to the findings of Inventors the T gene and the T3 gene 
are part of the nuclear pore complex. Nuclear pore complexes 
(NPCs) are extremely complicated structures which mediate 
the bi-directional transport of macromolecules between the 
nucleus and the cytoplasm. The nuclear pore complex is 
embedded in the nuclear envelope and encases a central 
channel with a structure only defined insufficiently thus 
far. Peripheral structures, short cytoplasmic filaments and 
a basket-like structure are attached on both sides of the 
central nuclear pore complex. This basket-like structure 
interacts with molecules which pass through the nuclear pore 
complex. The mechanism of synthesizing nuclear pore 
complexes is hardly understood thus far. In addition, it has 
been found when observing cells passing through mitosis that 
the nuclear envelope is dissolved deliberately and their 
components, including the nuclear pore proteins, are 
distributed over the mitotic cytoplasm. At the end of 
mitosis, all these components are used again to form the 
nuclear envelope of the daughter cells. Due to the detailed 
analysis of the gene T, Inventors found that the N-terminal 
half of the T protein is weakly homologous to the pore 
membrane protein P0M121. The homology covers the entire 
region of the P0M121 protein and has an identity of about 18 
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% on a protein level so that the DNAs underlying these 
proteins should not hybridize with one another, even under 
hardly stringent conditions. As regards the formation and 
structure of the nuclear pore, the T protein according to 
the invention plays a very fundamental role. In a detailed 
analysis of the protein, a lipophilic domain could be 
detected at the N-terminus of the T protein. However, this 
sequence has no homology to the lipophilic sequence of the 
P0M121 protein. There is also a short segment of amino acids 
which might serve as a signal sequence before the lipophilic 
domain in the T protein. In order to find out whether this 
putative signal sequence and the lipophilic domain are 
involved in vivo in the incorporation into the nuclear 
membrane, various constructs of the T gene were produced. 
Various parts of the N-terminus of the T protein were fused 
with the EnhancedGreenFluorescentProtein (EGFP) . The EGFP 
was here fused to the C-terminus of the T protein. The 
fusion protein which comprised the unchanged N-terminus of 
the T protein (putative signal sequence with lipophilic 
membrane domain) was actually incorporated into the nuclear 
membrane. However, the fusion construct from which the 
putative signal sequence and the lipophilic domains lack, 
was not incorporated into the nuclear membrane and 
accumulated in the cytoplasm. This showed that the N- 
terminus of the T protein is necessary and suffices to 
result in a localization within the nuclear membrane. In 
order to show that the T protein is actually localized in 
the nuclear membrane, antibodies were generated against a 
peptide sequence of the T protein. Immunohistochemical 
studies of tissues of man, mouse and rat were carried out 
with these antibodies . It showed that the antibody detects a 
protein which is localized in the nuclear membrane. Since it 
is difficult to differentiate by means of a light microscope 
whether the protein is localized in the nuclear membrane or 
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the nucleus itself, further analyses were made using the 
high-resolution method of electron microscopy. By this it 
was possible to clearly show that the T protein is localized 
in the nuclear membrane. As a detection reaction a second 
antibody was used here to which the enzyme horseradish 
peroxidase was coupled and which resulted in a color 
reaction (DAB) . The stain or coloring formed can be seen in 
the electron-microscopic pictures only on the cytoplasmic 
side of the nuclear membrane. This , indicates that the 
antibody recognizes an epitope of the T protein which is 
accessible from the cytoplasmic side for the antibody. The 
analysis of the immunohistochemical sections also showed 
that the antibody recognizes very specific neurons {cf. 
figure 24). The results of the analysis of the expression on 
a protein level by means of the antibody are highly 
consistent with the results of the analysis of the RNA 
expression. The mouse ortholog of the T gene was used in the 
RNA in situ analyses. Using the human T gene cDNA clones, 
murine cDNA clones of the mouse ortholog were initially 
isolated and sequenced for this purpose. The sequence 
analysis confirmed that the isolated cDNA clones was the 
mouse ortholog. Such a murine cDNA clone of the T gene was 
then used for the RNA in situ hybridization (cf. figures 25, 
26, 27, 28) . An expression analysis of the T gene of the 
mouse was then possible by means of this technique. The 
accurate analysis of the spatial-temporal expression profile 
showed that the T gene plays a decisive role in the 
g enera tion, formation and maintenance of the nervous system 
in vertebrates. No expression can be detected during the 
early mouse embryogenesis on day 9.5 post conceptionem (pc = 
post conceptionem). On day 10.5 pc, it is possible to detect 
an expression in the ventral mesencephalon and in the 
telencephalon. In this stage there is also a strong 
expression in the connecting region of the mesencephalon and 
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telencepalon ( f orebrain-midbrain ) . An expression of the T 
gene in the telencephalon, in the ventral mesencephalon and 
in the myelencephalon can be detected on day 11.5 pc. An 
expression in neurons of the mantle zone of the developing 
brain and in the nuclei of the peripheral nerves is visible 
on day 12.5 pc. Furthermore, there is an expression in the 
myelencephalon, spinal cord and spinal ganglia. A minor 
expression is detectable in the mesencephalon and 
telencephalon. No expression is detectable e.g. in 
proliferating neurons in the subventricular layer or in the 
migrating neurons of the 'intermediate 1 zone. On day 14.5 
pc, an expression in mesenchymal tissues, e.g. around the 
vertebra or in the region of developing bones, is also 
visible. A strong expression in all parts of the brain and 
the peripheral nervous system (e.g. spinal ganglia and nerve 
fibers of the tail) can be detected on day 16.5 pc - An 
expression in differentiating neurons of the mantle zone of 
the telecephalons can also be detected. Furthermore, an 
expression in neurons of the spinal cord and the spinal 
ganglia can be detected. When the brain develops after the 
birth, an expression in the olfactory bulb, in the cerebral 
cortex and in the developing hippocampus can be detected 
above all. A minor expression is found however in the 
coliculus and the developing cerebellum. A similar 
expression pattern exists in the fully developed brain. 

Northern blots (cf. figure 23) were carried out to find out 
where the T gene or T2 or T3 gene are expressed. The T gene 
is expressed predominantly in the brain, hardly or not at 
all in the heart, lungs, placenta, liver, skeletal muscle, 
kidney or pancreas (irrespective of adult or fetal tissue) . 
However, the T2 gene is virtually not expressed in the brain 
but strongly expressed in the heart (adult and fetal) , adult 
liver, adult skeletal muscle and adult kidney. The T3 gene 
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is expressed in all tested tissues (adult and fetal heart, 
brain, liver, kidney: placenta, adult skeletal muscle, adult 
pancreas), except in fetal lungs. 

Because of the discovery of the T gene and the detailed 
analysis of this gene with the information obtained 
therefrom a basis has been created for the development of 
fully novel medicaments and medicament compound classes. The 
bi-directional transport of molecules through the nuclear 
membrane is of decisive significance for the function of 
each eukaryotic cell. The information which is stored in the 
form of DNA (chromosomes) in the nucleus is transcribed into 
mRNA. However, the information is only translated into 
protein in the cytoplasm. If the transcribed information 
(mRNA) does not reach the cytoplasm, the information will be 
lost and dramatic disturbances may occur within the cell. 
This transport is, however, no one-way street. It is 
likewise important that certain substances and proteins 
reach the nucleus so as to maintain the function of the 
cell. If a transcription factor, for example, which - like 
the other proteins - is formed in the cytoplasm does not 
reach the cell nucleus, it cannot trigger the transcription 
of the other genes. Dramatic disturbances of the events in 
the cell, which may even comprise the dying of the cell or 
the organism, are often accompanied by this. This shows 
clearly that nuclear pore proteins perform an extremely 
important function within the cell. The analysis of the T 
gene has now shown that the T protein is also incorporated 
into the nuclear membrane. It is interesting that the T 
protein is almost twice as large as the P0M121 protein, i.e. 
it has a much greater binding capacity than the P0M121 
protein. The T protein is therefore very well suited to 
isolate possible binding partners which attach to the T 
protein, in particular to the C-terminus of the T protein. 
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The tissue-specific expression of the T gene shows 
strikingly that nuclear core proteins (in particular nuclear 
pore membrane proteins) do not have to be expressed in all 
cells and at all times like 'housekeeping 1 genes. The 
predominant expression of the T gene in the nervous system 
shows that the T protein in the nervous system performs a 
very specific function. The predominant expression of the T 
gene in the nervous system can now be used for the 
development of new medicaments and new medicament compound 
classes. New substances can now be isolated by means of the 
T protein, which influence deliberately the bi-directional 
transport in nuclear pores of the ..nervous system. The 
localization of the T protein within the nuclear membrane is 
in this case of major advantage. Chemical compounds can be 
tested by means of automated tests. Many pharmaceutical 
companies have suitable screening methods in which more than 
200, 000 chemicals can be tested. For this purpose, e.g. 
reporter assays (e.g. GFP fusion proteins, colored 
substances, etc. ) can be used which show the successful 
transport of a molecule into the nucleus or into the 
cytoplasm. By this, new active substances can then be 
isolated which deliberately influence the transport of 
molecules into nuclear pores, in particular those of the 
nervous system. 



Identifying and analyzing interactions between the T 
proteins according to the invention (T, T2, T3 protein) or 
peptides or fragments thereof and possible binding partners 
which may represent active substances within the above- 
mentioned meaning, can happen e.g. with the "yeast-two- 
hybrid system" (Fields, Nature 340 , pages 245-247, (1989)). 
This system is based on the discovery that cellular 
transcription activators, such as GAL 4 or lexA from yeast, 
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can be separated into two independent functional domains. 
Both domains are usually part of a protein in the cell 
nucleus of the yeast cell, which binds to certain activating 
sequences of different target genes and regulates the 
transcription thereof- In this connection, one domain, the 
DNA binding domain (BD) , binds specifically to a certain DNA 
target sequence (upstream activating sequence) in the 
vicinity of the target promoter. The other domain, the 
activation domain (AD) , increases the transcription rate of 
the target gene by interaction with the transcription 
initiation complex which is bound to the promoter of the 
target gene. In the "yeast-two-hybrid system", this 
structure is used by the transcription factors in modified 
form. The DNA binding domain (BD) of GAL 4 or lexA is 
expressed there as fusion protein with a "bait protein or 
peptide" (here: T, T2 or T3 protein/peptide ) in yeast cells. 
This fusion also has a nuclear localization signal by which 
it is transported into the cell nucleus of the yeast. The 
bait fusion protein binds therein to a target sequence (UAS) 
which is located in the employed yeast strain in the 
vicinity of the promoters of two reporter genes (e.g. 
auxotrophic marker (HIS3) and enzymatic marker (lacZ) . By 
this a constellation results in which the bait protein or 
peptide is exposed in direct spatial vicinity of the 
reporter gene promoter. Then, a second fusion protein is 
additionally expressed in the same yeast cell. It consists 
of the activation domain (AD) of GAL 4 or lexA and a prey 
protein or peptide. It also has a nuclear localization 
signal. The prey fusion protein is thus also transported 
into the cell nucleus of the yeast. If the prey protein and 
the bait protein exposed on the UAS physically interact with 
each other, it becomes more likely statistically that the 
activation domain is located in the vicinity of the reporter 
gene promoter. This results in an increase of the 
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transcription of the reporter genes whose extent is 
proportional to the strength of interaction between bait and 
prey protein. In this case, e.g. a cDNA library and also a 
combinatorial peptide library are in consideration as the 
prey proteins . 



The present invention also relates to a process of 
identifying inhibitors or enhancers of the T protein family 
according to the invention. For this purpose, the nucleic 
acid sequences or parts of these sequences, which are part 
of the T gene or the paralogs or orthologs thereof, are 
inserted in suitable vectors and used for transfecting or 
transforming cells, tissues or organisms. These changed 
cells, tissues or organisms are then used for identifying 
inhibitors or enhancers of the T protein or its paralog or 
ortholog proteins (e.g. T2 and T3) or proteins which 
interact directly or indirectly with these proteins. The 
inhibitors or enhancers identified by this approach can be 
used for pharmaceutical active substances or medicaments or 
for the production thereof and for the treatment of diseases 
such as cancer, neurological and psychiatric diseases and 
injuries of the nervous system. In the case of injuries of 
the nervous system, innate damage of the nervous system or 
the degenerative diseases of the nervous system, it is 
possible to support deliberately by this treatment inter 
alia the neuronal regeneration or improve the 
interconnection of individual nervous regions (used for 
inter alia Alzheimer's disease, Parkinson 1 s disease, 
schizophrenia, manic-depressive diseases, autism, mental 
retardation) . The present invention provides the possibility 
of testing the substances or therapeutic agents suitable to 
enhance or reduce the effect of the T protein or the family 
of the T proteins. In particular, the changed nuclear pore 
properties which are influenced by the proteins T and T3 can 
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be detected by suitable screening methods. The latter 
include e.g. visualization of the bi-directional transport 
through the nuclear pore or the detection of a modified 
transcription of cellular or reporter genes. Substances or 
therapeutic agents can also be identified which inhibit or 
promote the effect of proteins which are directly or 
indirectly involved in the effect of the T protein or the 
family of the T proteins. Substances or therapeutic agents 
which show an enhancement or reduction of the effect of the 
T protein (or T2 or T3) in the above-mentioned screening 
methods, can be used to determine whether the enhancement or 
the reduction of the effect of the T protein results in 
therapeutically desired effects. Above all the inhibition of 
the growth or spreading of tumor cells or the support of 
neuronal regeneration, e.g. after injuries of the nerves 
(inter alia paraplegia and head-brain trauma), are counted 
thereamong. The identified substances can then be used as 
medicaments or for the production of these medicaments. Due 
to these medicaments it is then possible to inhibit or block 
spreading of the disease-inducing cells and thus control or 
clear up the disease on the whole. An important application 
of these medicaments is inter alia preventing the growth and 
spreading of tumor cells. In addition thereto, the 
identified active substances are used as medicaments which 
stimulate deliberately the growth of certain cells. By this 
it is then possible to regenerate cells or structures of the 
nervous system damaged by injury or degenerative processes. 
The T protein (or T2 or T3) can also be used in screening 
methods allowing not only to detect the changed nuclear pore 
properties but also to identify prior or subsequent or 
parallel signal cascades. By this it is possible to identify 
e.g. tyrosine kinases or tyrosine phosphatases which 
regulate proteins which in turn influence directly or 
indirectly the action of the T protein (or T2 or T3) . As a 
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result, suitable targets for the positive influence of the 
events in the cells can be recognized and characterized. 
Furthermore, the T protein, although it occurs as a nuclear 
pore protein, is significant for the interactions with 
filaments of the cell, e.g. microtubuli and actin. These 
interactions can now be studied, e.g. by fusion proteins of 
the T protein with the EGFP protein. Cells which were stably 
or transiently transformed or transfected with constructs 
for such fusion-reporter proteins, can be incubated with 
substances or pharmaceutical preparations to identify 
substances which enhance or reduce the interaction of the T 
protein with filaments such as the actin filaments or the 
microtubuili . As a result, it is possible to isolate active 
substances which positively influence inter alia the growth 
of nerve cells or the inhibition of the growth of tumor 
cells. For example, immunoprecipitation has to be mentioned 
as a method of identifying such possible active substances. 
Proteins can be isolated by this which bind to the T protein 
family. Further immunoprecipitations can then be carried out 
with these proteins to isolate new proteins which then no 
longer interact directly with the T protein. 

The present invention also relates to a method of 
identifying further proteins which play a role in the 
development and function of the nervous system and/or are a 
nuclear pore protein, the method comprising the steps of: 

(a) producing an antibody against a protein of the T family 
(T, T2 or T3 protein) , 

(b) contacting a cell extract with the antibody and 
identifying the antibody/protein complex, 
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(c) analyzing the complex to identify a protein which has 
bound to the protein of the complex and is no antibody, 
and 



(d) optionally repeating steps (a) to (c) to identify 
further proteins of this function. 



The invention is described in more detail by means of the 
figures, which show: 

Figure 1: human cDNA sequence (gene T) and derived amino 
acid sequence 

Figure 2: human genomic DNA sequence (gene T) 

Figure 3: human genomic DNA sequence (gene T) 

Figure 4: human genomic DNA sequence (gene T) 

Figure 5: human genomic DNA sequence (gene T) 

Figure 6: human genomic DNA sequence (gene T) 

Figure 7: human genomic DNA sequence (gene T) 

Figure 8: human genomic DNA sequence (gene T) 

Figure 9: partial murine cDNA sequence (gene T) and derived 
amino acid sequence 



Figure 10: partial murine genomic DNA sequence (gene T) 

Figure 11: partial human cDNA sequence (gene T2 ) and derived 
amino acid sequence 
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Figure 12: partial murine cDNA sequence (gene T2) and 
derived amino acid sequence 

Figure 13: partial murine cDNA sequence (gene T2) and 
derived amino acid sequence 

Figure 14: splicing variant of the human T gene with derived 
amino acid sequence 

Figure 15: splicing variant of the human T gene with derived 
amino acid sequence 

Figure 16: partial human cDNA sequence (gene T2) with 
derived amino acid sequence 

Figure 17: partial human cDNA sequence (gene T3; protein 
isoform 1) with derived amino acid sequence 

Figure 18: partial human cDNA sequence (gene T3; protein 
isoform 2) with derived amino acid sequence 

Figure 19: partial murine cDNA sequence with derived amino 
acid sequence (gene T3) 

Figure 20: oligonucleotide and peptides (T gene) 

Figure 21: sequence comparison within the T family 

Figure 22: protein alignment of POM121 protein and T protein 



Figure 23: Northern blot analysis 
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Figure 24: immunohistochemical studies and electron- 
microscopic pictures 

Figure 25: in situ hybridization with embryonal RNA 

Figure 26: in situ hybridization with RNA from brain 

Figure 27: in situ hybridization with RNA from fetal brain 

Figure 28: in situ hybridization with RNA from nerve tissues 
of mouse 

Figure 29: comparison of the coiled-coil regions between 
CLIP protein, T protein and P0M121 

Figure 30: hydrophobicity blot for POM121, T protein and T3 
protein . 

The following clones were deposited with the DSMZ (Deutsche 
Sammlung fur Mikroorganismen and Zellkulturen GmbH) [German- 
type collection of microorganisms and cell cultures], 
Mascheroder Weg lb, Braunschweig, according to the Budapest 
treaty on August 18, 1998: 

clone JFC277 ( DSM1237 1 ) ; human cDNA; represents 
the human cDNA sequence of Bp 1218-3690 
clone JFC405 (DSM12372); human cDNA; represents 
the human cDNA sequence of Bp 1-1891 

clone JFC601 (DSM12373) ; murine cDNA; represents 
the murine cDNA sequence of Bp 225-3026 
clone JFC950 (DSM12374 ) ; human genomic clone; 
represents human genomic sequence 
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clone JFC955 (DSM12375); human genomic clone; 
represents human genomic sequence; comprises start 
of the cDNA sequence 

clone JFC N2112 (DSM12376) ; human genomic clone; 
was fully sequenced. The sequence is shown in 
figure 2 and contains the sequence of Bp 1756-4228 
of the human cDNA sequence. 

The following clone was deposited with DSMZ according to the 
Budapest treaty on February 2, 1999: 

clone JFC-BN27 (DSM 12659); contains the sequence 
of Bp 4370-8690 of the human cDNA sequence. 

The following clone was deposited with the DSMZ according to 
the Budapest treaty on February 19, 1999: 

clone JFC-BN20 (DSM 12698); contains the sequence 
of Bp 2025-6280 of the human cDNA sequence 

The following clone was deposited with the DSMZ according to 
the Budapest . treaty on February 1, 2000. 

cDNA clone pL70 (DSM13270); represents essential 
parts of the gene T3 . 

The sequences shown in figures 2 to 8 originates from clones 
JFC955 (DSM 12375) and JFC950 (DSM 12374). The sequence 
shown in figure 1 originates from clones JFC277 (DSM 12371) , 
JFC405 (DSM 12372) and JFC-BN27 (DSM 12659) and JFC-BN20 
(DSM 12698) . The sequence shown in figure 9 originates from 
the clone JFC610 (DSM12373) . 
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The invention is further described by means of the following 
embodiment . 



EXAMPLES 



As to the methods employed reference is also made to 
Sambrook, J., Fritsch, E.F. and Maniatis, T. (Molecular 
Cloning; A Laboratory Manual; second edition; Cold Spring 
Harbor Laboratory Press, 1989) and Current Protocols in 
Molecular Biology (John Wiley and Sons, 1994-1998), the 
below techniques, in particular preparation of DNA or RNA or 
Northern blot, being sufficiently known to, and mastered by, 
the person skilled in the art. 



Before it is described in detail how the experiments are 
carried out, the operating strategy is to be explained 
first . 

When screening for genes triggering diseases of the CNS 
(e.g. neurodegenerative diseases, mental retardations, 
tumoral diseases of the CNS) in the mutated state, 23 cDNA 
clones were isolated from a human fetal brain cDNA library 
(Stratagene company, Heidelberg) . A human fetal brain cDNA 
library was used as a starting material, since it was 
assumed that genes which play a role in the development of 
the CNS and in particular of the brain are prsent in a fetal 
brain cDNA library. However, since what is called 
housekeeping genes (genes expressed in most tissues) are 
also expressed in the CNS, it was tested whether the select 
cDNA clones originate from genes having a CNS-specific 
expression. For this purpose, the cDNA pieces ( 'inserts' ) 
contained in the individual cDNA clones were isolated and 
used for hybridization with Northern blots. The employed 
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Northern blots comprised polyA RNA from different human 
tissues (e.g. brain, skeletal muscle, liver and kidney) and 
various development stages (fetal and adult tissues) . Since 
as mentioned above not only brain-specific genes are 
expressed in the brain, the hybridization with the Northern 
blots was used to identify cDNA clones which are expressed 
above all in the brain and not so much in other tissues. Due 
to this differential analysis it was possible to identify a 
cDNA clone which has a brain-specific expression pattern. 
Using this cDNA clone, the entire mRNA sequence for the new 
protein encoded therein could be isolated and deciphered 
(gene T with protein TP encoded therein) by repeated 
hybridization of the fetal cDNA library. 



EXAMPLE 1 : Identification of the T genes 
1. Titration of the cDNA libraries 

In order to ensure an effective infection, it was initially 
necessary to produce phage-competent bacteria in an 
overnight culture. The magnesium ions contained in the 
medium induce the maltose receptor of the bacteria to which 
the phage binds to infect the bacterium. 



Performance : 

Charge 50 ytl E. coli XLl-Blue in 50 ml LB broth, the medium 
being admixed with MgS0 4 in a concentration of 10 mM. 
Incubate overnight at 30°C and 220 rpm. Centrifuge off the 
bacteria at 4°C and 1000 xg for 10 min. Resuspend in 25 ml 
10 mM MgS0 4 . The thus produced phage-component bacteria 
could be stored at 4°C for up to one week. 



2. Culturing the cDNA libraries 
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For culturing the library, Baltimore Biological Lab. (BBL) 
agar plates and BBL top agarose had to be prepared. The 
phages (human or murine cDNA library, Stratagene company) 
were mixed with SM medium to a dilution of 1:10 3 and 1:10 4 to 
obtain individual plaques after the culturing. 

Performance : 

For the BBL agar (pH 7.2) 10 g BBL trypticase, 5 g NaCl and 
10 g Select agar were weighed and filled to 1 1 with H 2 0. 
The agar is dissolved by autoclaving. After cooling to about 
60° pour the plates. The plates are preheated to 37 °C prior 
to their use to avoid premature solidification of the top 
agarose. The BBL top agarose (pH 7.2) was prepared with 10 g 
BBL trypticase, 5 g NaCl, 6.5 g agarose and 10 ml 1 M MgS0 4 
solution. Dissolve by autoclaving and provide in the water 
bath to 41°C. Add 15 ul of the above indicated dilute phage 
solution and 250 ul of the competent XL-1 bacteria in a 15 
ml Falcon tube. Incubate at room temperature for 20 minutes. 
Add 10 ml BBL top agarose, swivel and place on the heated 
agar plate. The top agarose layer is solid after about 20 
minutes and the plates can be stacked with the agar side up. 
Incubation is carried out overnight at 37 °C. The plates can 
be stored at 4°C after expired incubation time or can be 
used directly for transferring the phage plaques. Carefully 
close the plates for storing them together with a 
chloroform-soaked cloth in plastic bags. The chloroform 
prevents the growth of cryophilic bacteria and fungi. 

3. In vivo excision 

The employed cDNA libraries (human and murine fetal brain 
cDNA library; Stratagene company, Heidelberg) were cloned in 
the vector X-ZAPII. Due to this there was the possibility of 
circumventing the subcloning of the phage insert in a 
plasmid vector. This protocol permits to transfer cDNA which 
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is located as insert in the A.-ZAPII vector into an insert in 
simple way by an in vivo preparation which is now found in 
the plasmid Bluescript SK(-). In principle, this preparation 
serves for introducing by a helper phage information for 
proteins which permit DNA amplification only in the region 
of the phage genome, which have the genetic information for 
the plasmid with cDNA insert. For the most part, the method 
was carried out in accordance with the protocol of the 
manufacturer (Stratagene) . 

In particular, culturing was made such that individual phage 
plaques were on the plate. Then, the in vivo excision 
protocol was carried out with these individual plaques. The 
plasmid DNA and its plasmid inserts were isolated from the 
bacterial clones and subsequently hybridized with Northern 
blots. The selection of further clones to be studied was 
based on the expression pattern in the Northern blots. 

Performance: 

Mix 100 pi of a single phage X--ZAPII clone with 200 pi XL1 
bacteria and 2 pi helper phages (contained in the Stratagene 
kit) . Shake for 15 min. at 37°C and 80 rpm, the specific 
attachment of both phage types to the host bacterium taking 
place. Add 3 ml LB broth. Incubate for 2 h at 37 °C and 200 
rpm. The DNA replication of the plasmid contained in the X- 
ZAPII, its circularization and the packing into coat 
proteins take place and discharge from the bacterium occur 
during this time. Heat to 70°C for 20 minutes. Thereafter, 
centrifuge at 4000 g for 15 minutes. This kills the still 
remaining bacteria and separates their fragments from the 
plasmids existing in the phage coat, which are found in the 
supernatant. Add 1 pi thereof to 200 pi SOLR host cells, 
incubate at 37 °C for 15 minutes. Plate 100 pi onto LB/Amp 
plates. Store at 37 °C overnight. The then grown bacterial 
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clones contain the plasmid with the corresponding cDNA 
insert. A mini-prep DNA preparation was carried out each. 

4. "random primed" DNA labeling 

The radioactive labeling of the double-stranded insert DNA 
of the cDNA clone was carried out as follows for the further 
isolation of overlapping cDNA clones: 

Performance : 

Dissolve 100 ng DNA in a volume of 12 ul H 2 0 for a typical 
labeling batch. 10-minute heating to 95°C effects the 
denaturation of the DNA into single strands. Store the 
preparation on ice to prevent reassociat ion of the two 
complementary DNA strands. Complete the reaction batch by 4 
ul OLG (oligo-labelling buffer), 1 ul Klenow (1U) and 2.5 ul 
a- 32 P-dCTP and 2.5 ul a- 32 P-dATP. Incubate at room 
temperature overnight. Based on the hexanucleotides attached 
to a single strand, the formation of the complementary 
strand takes place during this time by the Klenow fragment 
of the E. coli DNA polymerase I. The DNA is labeled 
radioactively by incorporating a- 32 P-dCTP and the a- 32 P-dATP. 

5 . Separation of non-incorporated radioactive nucleotides 

The non-incorporated nucleotides were separated by means of 
a personally prepared sephadex G-50 column. The separation 
principle of the column is based on the exclusion 
chromatography. The smaller non-incorporated nucleotides fit 
into small pores of the column material while the DNA is 
locked out. The volume in which the nucleotides may move is 
thus greater than the volume available to the DNA. If a 
mixture of DNA and nucleotides is placed on the column, the 
DNA runs through the column faster than the nucleotides. 
This permits the separation of non-incorporated nucleotides. 
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Performance : 

A Pasteur pipette was closed with a small glass bead. Fill 
the Pasteur pipette with sephadex G-50 ("fine") dissolved in 
water until the filling material is 5 cm below the top edge 
of the Pasteur pipette. Rinse the column 2 times with TE. 
Apply the above radioactive labeling batch. Add 320 pi TE. 
Discard the solution which has run through the column. Place 
an Eppendorf tube below the column. Add 350 pi TE. Collect 
the radioactive solution run through the column. 

6. Plaque "blot" 

The plaque "blot" was made to analyze the cDNA library to 
make accessible the cDNA in the phage clones to 
-hybridization . 

Performance : 

Place a labeled hybond-N membrane provided with an 
inscription in air bubble-free manner on the plate with the 
phage plaques for one minute. The labeling pattern was 
transferred. Place it on a Whatman paper soaked with 
denaturing solution (0.5 M NaOH; 1.5 M NaCl) for 10 minutes. 
Neutralize in 50 mM phosphate buffer for 10 minutes. The 
rests of the bacterial layer are wiped off with slight 
pressure using a phosphate buffer-soaked Kleenex cloth. The 
filters are spread at room temperature for drying. 
Thereafter, the filters were baked at 90°C for 1 h. 

7 . Hybridization 

The hybridization is based on the binding of complementary, 
single-stranded nucleic acids. For this purpose, the DNA to 
be studied was immobilized on a membrane and hybridized with 
a radioactively labeled probe. The complementary binding is 
maintained even after washing off the non-specif ically 
adhering probes and can be made visible by means of 
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autoradiography. Single-stranded molecules were incubated 
during the hybridization under salt and temperature 
conditions which support the formation of base-paired double 
strands. A decisive factor in the association and 
dissociation kinetics are the hydrogen bridge bonds between 
the base pairs G-C and A-T . The hybridization reaction is 
influenced by changing the temperature and the salt and 
sample concentrations . 

Performance : 

First, prehybridize the DNA filters in hybridization 
solution (0.5 M NaPi (pH 7.2); 7 % SDS; 0.2 % BSA; 0.2 % 
PETG 6000; 0.05 % polyvinyl pyrrolidone 360000; 0.05 % 
Ficoll 70000; 0.5 % dextrane sulfate) with a 0.1 ml/cm 2 at 
65°C. For this purpose, incubate the filters in a plastics 
box in a shaking water bath at 65°C for a period of at least 
1 h. Discard the prehybridization solution. Place the 
radioactively labeled sample (see above items 4. and 5.) 
with 0.5 ml/cm 2 of hybridization solution (65°C) on the 
filters. The activity of the sample should not drop below 50 
cpm, measured at a distance of 40 cm. The hybridization 
takes place overnight at 65°C (human cDNA library) or 55°C 

(interspecies hybridizations man-mouse and for isolating the 
homologous genes) . Wash the filters two times for 30 minutes 
with about 500 ml wash buffer in a shaking bath at 65°C 

(55°C) . Autoradiography was then carried out. 

8 . Autoradiography 

The filters were packed in plastic foodwrap. The 
autoradiography was made at -80°C in an X-ray cassette 
containing a reinforcing film made of calcium tungstate. The 
exposure is 30 minutes to several days, depending on the 
strength of the signal. 




The complete mRNA which codes for the protein of the T gene 
could be isolated by means of the above mentioned 
techniques. Furthermore, using cDNA clones of this newly 
isolated T gene it was possible to isolate two further genes 
(T2 and T3) which have distinct homologies with this gene. 
For this purpose, the above mentioned techniques were used 
again. For isolating the related genes T2 and T3, the 
hybridization temperature was lowered to 55°C. 



EXAMPLE 2 : Northern blot 



The 'multiple tissue Northern blots 1 were purchased from the 
CLONTECH company (Palo Alto, California, U.S.A.) and used in 
accordance with the instructions from the manufacturer. The 
respective DNA samples of the genes T, T2 and T3 were 
labeled radioactively and hybridized with the Northern 
blots. The sequence of bp 1-4200 of figure 1 was used for 
the analysis of the expression pattern on a Northern blot 
level. For the gene T3 the sequence of bp 1310-4870 of 
figure 17 was used for hybridization. The sequence of bp 
3120-4230 of figure 16 was used for the gene T2 . The "random 
priming" method was used for the radioactive labeling of 
double-stranded DNA. 



a) Random priming: 

Dissolve 100 ng DNA in a volume of 12 pi for a typical 
labeling batch. 10-minute heating to 95°C effects the 
denaturation of the DNA into single strands. Store the batch 
on ice to prevent reassociat ion of the two complementary DNA 
strands. Complete the reaction batch by 4 pi OLB, 1 pi 
Klenow (1U) and 2.5 pi a- 32 P-dCTP and 2.5 pi a- 32 P-dATP. 
Incubate at room temperature overnight . Based on the 
hexanucleotides attached to a single strand, the formation 
of the complementary strands takes place during this time by 
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the Klenow fragment of the E. coli DN polymerase I. The DNA 
is labeled radioact ively by the incorporation of the a- 32 P- 
dCTP and the a- 32 P-dATP. 

The non-incorporated nucleotides were separated by means of 
a personally prepared sephadex G-50 column. The separation 
principle of the column is based on the exclusion 
chromatography. The smaller non-incorporated nucleotides fit 
into small pores of the column material while the DNA is 
locked out. The volume in which the nucleotide may move is 
thus greater than the volume available to the DNA. If a 
mixture of DNA and nucleotides is placed on the column, the 
DNA runs through the column faster than the nucleotides. 
This permits the separation of non-incorporated nucleotides. 
For this purpose, a Pasteur pipette is closed with a small 
glass bead. Fill the Pasteur pipette with sephadex G-50 
("fine") dissolved in water until the filling material is 5 
cm below the top edge of the Pasteur pipette. Rinse the 
column 2 times with TE . Apply the above radioactive labeling 
batch. Add 320 ul TE. Discard the solution which has run 
through the column. Place Eppendorf tube below the column. 
Add 350 ul TE. Collect the radioactive solution run through 
the column. 

b) Hybridization: 

The Northern blots were hybridized as described below. 
First, the Northern blots were prehybridized at 65°C in 10 
ml hybridization solution (350 ml 20 % SDS, 500 ml 1 M 
phosphate buffer, pH 7.2; 150 ml distilled water). For this 
purpose, the Northern blots were incubated in a glass tube 
in a hybridization roll-over-type furnace at 65°C for a 
period of 6 h. 
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The prehybridization solution was discarded. The 
radioactively labeled sample was placed with 10 ml 
hybridization solution (65°C) on the filters. 

The hybridization was carried out at 65°C overnight. The 
filters were then washed two times for 30 min. with about 
500 ml wash buffer (80 ml 1 M phosphate buffer, pH 7.2; 100 
ml 20 % SDS, 1820 ml distilled water) at 65°C in a shaking 
bath. 

c) Autoradiography 

The filters were welded into plastic film. The 
autoradiography was made at -80 °C in an X-ray cassette which 
contained a reinforcing film of calcium tungstate. Exposure 
was 1 to 4 days depending on the strength of the signal. 

The results of the Northern blots carried out are shown in 
figure 23. 

EXAMPLE 3: RNA in situ hybridization 

Embryos in various development stages were isolated from 
pregnant NMRI mice. The embryos and other tissue samples 
were fixed overnight with 4 % paraformaldehyde in PBS at 
4°C. 10 \xm freezing sections of the embryos were transferred 
to slides coated with 3-aminopropyl triethoxysilane . Sense 
strand ("sense") and antisense strand ( "antisense" ) samples 
were produced by transcription with a- 35 S-UTP with a 
specific activity of >10 9 decays per minute/yg. For this 
purpose, the linearized mouse T gene cDNA clone from figure 
9 was transcribed with T7 or Sp6-RNA polymerase. The sample 
length was reduced by alkaline lysis to 150 to 200 
nucleotides. The slides were prehybridized at 54°C in a 
solution containing 50 % formamide, 10 % dextrane sulfate, 
0.3 M NaCl, 10 mM Tris, 10 mM sodium phosphate, pH 6.8, 20 
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mM dithiothreitol, 0.2 % Denhardt's solution, 0.1 Triton X- 
100, 0.1 mg/ml Escherichia coli RNA and 0.1 mM non- 
radioactive a-S-UTP. The 35 S-labeled sample (8 x 10 4 decays 
per minute per ml) were added to the hybridizing mixture for 
the hybridization and the hybridization was then continued 
for 16 h at 54 °C in a humid chamber. The slides were then 
washed in the hybridization solution for 2 hours. The 
remaining non-hybridized RNA sample was then digested using 
RNase A. Thereafter, the slides were washed for 30 minutes 
at 37°C with 2x SSC, 0.1 % SDS and for 30 minutes with O.lx 
SSC, 0.1 % SDS. Then, the slides were dehydrated with 
increasing ethanol concentrations. The slides were covered 
with Ilford K5 autoradiography emulsion. After 1 to 2 weeks 
of exposure at 4°C, the slides were incubated in Kodak D19b 
developer and dyed with Giemsa. The sections were analyzed 
in dark field and bright field illumination with a Zeiss SV8 
stereomicroscope and an Axiophot microscope and photographed 
with an Agfa ortho black-and-white film. 

The results of the RNA in situ hybridization are shown in 
figures 25, 26, 27 and 28. 

Figure 25: expression of the murine T gene during the mouse 
embryogenesis. Bright field (a,c,e,g) and dark field 
pictures (b, d, f, h) of horizontal (a,b) and sagittal 
sections (c-h) through a 10.5 (a,b), 12.5 (c,d), 14.5 (e,f) 
and 16.5 (g,h) dpc embryo (dpc = days post conceptionem) 
which were hybridized with an antisense ribo sample of the 
murine T gene. Dec = decidua, g - guts, he = heart, lab = 
labyrinth, li = liver, me = myelcephalon, sc = spinal cord, 
sga - spinal ganglia, sb - tooth bud, te - telencephalon. 
Bar = 1 mm. 
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Figure 26: Expression of the murine T gene in the 

postnatal brain. Bright field (a,d) and dark field pictures 
(b,c,e,f) of horizontal sections through an 1 wpn (weeks 
post natalls) and 6 wpn head, which were hybridized with a T 
gene antisense (b,e) and a sense sample (c, f ) . cd = 
cerebellum, cor = cortex, cos = colliculus, ey = eye, hi = 
hippocampus, ne = nasal epithelium, ob = olfactory bulb, bar 
- 1 mm. 

Figure 27: Greater enlargement of the 10.5 dcp embryo of 
figure 25 a,b. The arrows point to a region of little 
expression in the somites (arrows in b) , An intense 
expression can be seen in the region between mesencephalon 
and telencephalon ( " f orebrain-midbrain junction"). Aod = 
aorta dorsalis, me = mesencephalon, sc = spinal cord, te = 
telencephalon. Bar = 100 urn. 

Figure 28: Expression of the T gene during the development 
of the nervous system. Expression of the T gene in neurons 
of the mantle zone of the developing brain and in nuclei of 
peripheral nerves (arrow in b) . No expression is visible in 
proliferating neurons in the subventricular layer or in 
migrating neurons of the intermediate zone (c,d). On day 
16.5, an intense expression is visible in differentiating 
neurons of the mantle zone of the telencephalon (e,d). A 
minor expression is also visible in neurons of the spinal 
cord and the spinal ganglia (g,h). Furthermore, a minor 
expression is visible in an individual layer below the skin 
(g,h). iz = intermediate zone, mz = mantle zone, sc = spinal 
cord, sga ™ spinal ganglia, sk = skin, svl = subventricular 
layer, vn = ventricle. Bar = 100 pm. 



EXAMPLE 4 : Production of antibodies 



O *3 9 ::L H-Ei n 4-*9 «, O i& O 115 O E!! 



53 



Using a synthetically produced peptide of the sequence 
"EKGEDPETRRMRTVKNIAD" animals are immunized to produce 
antibodies against the T protein as follows: 

Immunization protocol for polyclonal antibodies in rabbits 
600 ug purified KLH-linked peptide in 0.7 ml PBS and 0.7 
complete or incomplete Freund's adjuvant are used per 
immunization : 

Day 0: 1 st immunization (complete Freund's adjuvant) 
Day 14: 2 nd immunization (incomplete Freund's adjuvant; 
icFA) 

Day 28: 3 rd immunization (icFA) 
Day 56: 4 th immunization (icFA) 
Day 80: bleeding to death. 

The rabbit serum is tested in an immunoblot. For this 
purpose, the protein used for the immunization is subjected 
to SDS polyacrylamide gel electrophoresis and transferred to 
a nitrocellulose filter (cf. Khyse-Andersen, J., J. Biochem. 
Biophys. Meth. 10 (1984), 203-209). The Western blot 
analysis was carried out as described in Bock, C.-T. et al., 
Virus Genes 8, (1994), 215-229. For this purpose, the 
nitrocellulose filter is incubated with a first antibody at 
37 °C for one hour. This antibody is the rabbit serum 
(1:10000 in PBS). After several wash steps using PBS, the 
nitrocellulose filter is incubated with a second antibody. 
This antibody is an alkaline phosphatase-coupled monoclonal 
goat anti-rabbit IgG antibody (Dianova company) (1:5000) in 
PBS. 30 minutes of incubation at 37°C are followed by 
several wash steps using PBS and subsequently by the 
alkaline phosphatase detection reaction with developer 
solution (36 uM 5 ' -bromo-4-chloro-3-indolylphosphate, 400 uM 
nitro blue tetrazolium, 100 mM Tris-HCl, pH 9.5, 100 mM 
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NaCl, 5 mM MgCl 2 ) at room temperature until bands become 
visible . 

It shows that polyclonal antibodies according to the 
invention can be prepared. 

Immunization protocol for polyclonal antibodies in chickens 

100 ]ig of purified KLH-linked peptide in 0.8 ml PBS and 0.8 
ml of complete or incomplete Freund's adjuvant are used per 
immunization . 

Day 0: 1 st immunization (complete Freund's adjuvant) 
Day 28: 2 nd immunization (incomplete Freund's adjuvant; 
icFA) 

Day 50: 3 rd immunization (icFA) 

Antibodies are extracted from egg yolk and tested in a 
Western blot. Polyclonal antibodies according to the 
invention are detected. 

Immunization protocol for monoclonal antibodies in mice 

250 lag of purified KLH-coupled peptide in 0.25 ml PBS and 
0.25 ml of complete or incomplete Freund's adjuvant are used 
per immunization. The peptide is dissolved in 0.5 ml 
(without adjuvant) in the 4 th immunization. 

Day 0: 1 st immunization (complete Freund's adjuvant) 

Day 28: 2 nd immunization (incomplete Freund's adjuvant; 



icFA) 



Day 5 6 



3 



immunization (icFA) 



Day 8 4 



th 



immunization (PBS) 



Day 87 



fusion . 



55 



Supernatants of hybridomas are tested in a Western blot. 
Monoclonal antibodies according to the invention are 
identified. 

EXAMPLE 5 : immunohistochemical studies 

The immunohistochemical studies shown in figure 24 were made 
with an affinity-purified polyclonal rabbit antibody, 
produced above, against the T protein (referred to as first 
antibody below) . Mouse brain was removed and treated as 
follows : 

1 st day 

section thickness 6-10 ym, common fixation on slides, 
storage at -80 °C for up to about 2 months 

Take out the sections the evening before and allow them to 
dry at room temperature overnight 

Rinse slides in PBS, pour off, rinse once again, thereafter 
allow to stand in PBS for 10 min. 

Take out slides and wipe off the liquid around the tissue 
using a cloth . 

Encircle using PAP-PEN (protein-glycerol ; Dako company) so 
that no more liquid can flow out. 

Add 100 pi peroxidase blocking solution (Dako company, 
Hamburg), incubate for 20 minutes. 

Rinse slides in PBS, pour off, rinse again, thereafter allow 
to stand in PBS for 10 min. 

Take out slides and wipe off the liquid around the tissue 
using a cloth. 

Prepare an 1:10 dilution of normal (sheep) serum in PBS 
(e.g. sheep Dako X0503, Dako company, Hamburg), add 100 yl 
thereof and incubate for 20 minutes. 
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Rinse slides in PBS, pour off, rinse again, thereafter allow 
to stand in PBS for 10 minutes. 

Take out slide and wipe off the liquid around the tissue 
using a cloth. 

Add first antibody in a dilution of 1:100. 

Add 100 ul of the first antibody (in PBS) and incubate in a 
refrigerator in a humid chamber overnight. Control: without 
first antibody. 

2 nd day 

Take humid chamber out of the refrigerator and allow to 
stand at room temperature. Rinse slide in PBS, pour off, 
rinse again, thereafter allow to stand in PBS for 10 
minutes, when many slides are analyzed wash two times with 
PBS. 

Take out slides and wipe off the liquid around the tissue 
using a cloth. 

Prepare a 1:100 dilution of second antibody "antirabbit 
biotinylated" (Amersham company, Braunschweig) in PBS and 
add 100 ul thereof. 

Incubate in a humid chamber at room temperature for 4 5 
minutes . 

Rinse slides in PBS, pour off, rinse again, thereafter allow 
to stand in PBS for 10 minutes. 

Take out slide and wipe off the liquid around the tissue 
using a cloth. 

Prepare a 1:100 dilution of streptavidine peroxidase 
(streptavidine horseradish) (Amersham company, Braunschweig) 
with PBS and add 100 \il thereof. 

Incubate in a humid chamber at room temperature for 45 
minutes . 
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Rinse slides in PBS, pour off, rinse again, thereafter allow 
to stand in PBS for 10 minutes. 

Take out slides and wipe off the liquid around the tissue 
using a cloth. 

Staining : Add one drop chromogen per ml buffer just before 
the use. Vortex and place in the dark. 

Add 100 ]al staining solution (Dako company, Hamburg) . 
Finally, stain the control. Incubate for about 2 minutes. 
Incubate slides in water. Inspect under a microscope. 

Place 1-2 drops of crystal Mount on the section. If there is 
an air bubble, suck it off with a paper handkerchief. 
The rest of the slide is wiped doff using HCl-EtOH to remove 
the stain. 

Place a line of adhesive (Eukitt) on the cover glass. Press 
the cover glass onto the slide without producing air 
bubbles. 

The enzyme in the second antibody results in a dye formation 
(DAB) so that the T protein can be detected. 

Figure 24 (a-d) : Light-microscopic pictures which show that 
the T protein is localized in or at the nucleus of the cell. 
The electron-microscopic picture in e shows that the T 
protein is not localized in the nucleus but in the membrane. 
The pictures are highly consistent with a function as a 
membrane-terminal nuclear pore protein. The arrows in e show 
the stain formed which can be seen on the cytoplasmic side 
of the nuclear membrane. 
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Claims 

1. DNA sequence coding for a protein which is involved in 
the development of the nervous system, in particular 
the CNS, and is expressed in a tissue-specific and 
development-specific manner, wherein the DNA sequence 
comprises the following DNA sequences: 

(a) the DNA sequence of figure 1, figure 2, figure 3, 
figure 4, figure 5, figure 6, figure 7 or figure 
8; 

(b) the DNA sequence of figure 9 or figure 10; 

(c) the DNA sequence of figure 11; 

(d) the DNA sequence of figure 12 or figure 13; 

(e) the DNA sequence of figure 14 or figure 15; 

(f) the DNA sequence of figure 16; 

(g) the DNA sequence of figure 17 or 18; 

(h) the DNA sequence of figure 19; 

(i) a DNA sequence hybridizing with (a)..., (b) , (c) , 
(d), (e), (f), (g) or (h) ; 

(j) fragments, variants, functional equivalents, 
derivatives or precursors of the DNA sequence of 
(a), (b), (c), (d), (e), (f), (g), (h) or (i); or 

(k) a DNA sequence which differs from the DNA sequence 
of (a), (b), (c), (d), (e), (f), (g), (h) , (i) or 
(j) due to the degeneration of the genetic code. 

2. The DNA sequence according to claim 1, which codes for 
a protein or peptide comprising the amino acid sequence 
of figure 1, figure 9, figure 11, figure 12, figure 13, 
figure 14, figure 15, figure 16, figure 17, figure 18 
or figure 19, wherein the protein or peptide has the 
biological activity defined in claim 1. 
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3. An antisense RNA, characterized in that it is 
complementary to the DNA sequence of claim 1 or 2 and 
can reduce or inhibit the synthesis of the protein 
encoded by this DNA sequence. 

4. Ribozyme, characterized in that it is complementary to 
the DNA sequence of claim 1 or 2 and can bind 
specifically to the RNA transcribed by this DNA 
sequence and can cleave it so as to reduce or inhibit 
the synthesis of the protein encoded by this DNA 
sequence . 

5. Expression vector, containing the DNA sequence 
according to claim 1 or 2 or coding for the antisense 
RNA according to claim 3 or the ribozyme according to 
claim 4 . 

6. The expression vector according to claim 5, which 
comprises additionally the promoter of the human T gene 
or an ortholog of the T gene. 

7. Expression vector according to claim 5 or 6, which 
codes for the T, T2 or T3 proteins or for fragments 
thereof in the form of a reporter fusion protein. 

8. Host cell which is transformed with the expression 
vector according to any of claims 5 to 7 . 

9. Protein which is encoded by the DNA sequence according 
to claim 1 or 2 and which is involved in the 
development of the nervous system and is expressed in 
tissue-specific and development-specific manner, or 
fusion proteins, fragments, variants, derivatives or 
precursors of the protein. 
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10. Protein according to claim 9, which has one of the 
following motives: 

Motive 1: 

(A,T) (I,P,V) (L,T) (G,A,Q) ( L , V) XXX (L, V) 

Motive 2: 
IYTDQWAN 
Motive 3: 

AXXXXXXXXXGXXXXXXAXXXXXXXXXXXXXXXXXXQ 
Motive 4: 

SXXXXDX (12, 20) KX (17, 22) AXXXXXXXXL 
Motive 5: 
IYTDWANXXLX (K, R) 
Motive 6: 

KX ( 1 8 , 2 1 ) AXXXXXXXXLX ( 1 5 , 2 4 ) S 
Motive 7: 

NX ( 3 , 11) SXXXAXXXXXXXL 

wherein X = every amino acid 

(A, T) = amino acid A or T at this site 
X (number 1, number 2) = number 1 to number 2 
Xs at this site 

11. Method of producing the protein according to claim 9, 
which comprises culturing the host cell according to 
claim 8 under suitable conditions and obtaining the 
protein from the cell or the culture medium. 

12. Antibody which is directed against the protein 
according to claim 9 or fragment thereof. 

13. Antibody according to claim 12, which is obtained by 
immunizing animals with a peptide having the sequence 
" EKGEDPETRRMRTVKNIAD 11 . 
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14. Use of the DNA sequence according to claim 1 or 2, the 
antisense RNA according to claim 3, the ribozyme 
according to claim 4, the expression vector according 
to any of claims 5 to 7, the protein according to claim 
9 or the antibody or the fragment thereof according to 
claim 12 or 13 for preventing or treating diseases of 
the nervous system, in particular of the CNS. 

15. Use according to claim 14, wherein the disease of the 
nervous system is a tumoral disease of the CNS. 

16. Use according to claim 14, wherein the treatment of 
diseases of the nervous system are the promotion of the 
neuronal regeneration in the case of injuries of the 
nervous system and degenerative diseases of the nervous 
system. 

17. Use according to claim 14, wherein the treatment of 
diseases of the nervous system are the regeneration of 
the neuronal linkages and the regeneration of the 
innate and acquired malfunctions of the nervous system. 

18. Use according to claim 15 for inhibiting the growth and 
spreading of tumor cells. 

19. Diagnostic method for detecting a disturbed expression 
of the protein according to claim 9 or for detecting a 
changed form of this protein, in which a sample is 
contacted with the DNA sequence according to claim 1 or 
2 or the antibody or the fragment thereof according to 
claim 12 or 13 and then it is determined directly or 
indirectly whether the concentration of the protein 
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and/or its amino acid sequence differs from a protein 
obtained from a healthy patient. 

20. Diagnostic kit for carrying out the method according to 
claim 19, which contains the DNA sequence according to 
claim 1 or 2 and/or the antibody or the fragment 
thereof according to claim 12 or 13. 

21. Non-human mammal whose naturally occurring T, T2 or T3 
gene comprises a change in the gene structure or the 
gene sequence. 

22. Non-human mammal, wherein a change of the gene 
structure of the T, T2 or G3 gene is achieved in the. 
mammal by introducing a deletion in place of which a 
homologous or heterologous sequence is introduced. 

23. Non-human mammal, wherein a change of the gene 
structure of the T, T2 or G3 gene is achieved by 
inserting a homologous or heterologous sequence in the 
corresponding gene naturally occurring in the mammal. 

24. Non-human mammal according to claim 22 or 23, wherein 
the heterologous sequence is the selection marker 
sequence . 

25. Non-human mammal according to claim 24, wherein the 
selection marker sequence conveys resistance to 
neomycin. 

26. A method of producing a non-human mammal according to 
any of claims 21 to 25, characterized by the steps of: 
(a) producing a DNA fragment, in particular a vector, 

containing a changed T, T2 or G3 gene, the T, T2 
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or T3 gene having been modified by inserting a 
heterologous sequence, in particular a selectable 
marker; 

(b) preparing embryonal stem cells from a non-human 
mammal (preferably a mouse) ; 

(c) transforming the embryonal stem cells from step 
(b) with the DNA fragment from step (a) , the T 
gene in the embryonal stem cells being changed by 
homologous recombination with the DNA fragment 
from (a) 

(d) culturing the cells from step (c) , 

(e) selecting the cultured cells from step (d) for the 
presence of the heterologous sequence, in 
particular the selectable marker, 

(f) producing chimeric non-human mammals from the 
cells of step (e) by injecting these cells into 
mammalian blastocysts (preferably mouse 
blastocysts), transferring the blastocysts to 
pseudo-pregnant female mammals (preferably mouse) 
and analyzing the resulting offspring for a change 
of the T, T2 or T3 gene. 

27. Transgenic cell or tissue which is capable of 
expressing a T protein or part of the T protein or an 
ortholog thereof. 

28. Use of the non-human mammal according to any of claims 
21 to 25 or the transgenic cell or the transgenic 
tissue according to claim 27 for the analysis of the 
function of the T gene family. 

29. Use of the non-human mammal according to any of claims 
21 to 25 or the transgenic cell or the transgenic 
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tissue according to claim 27 for identifying inhibitors 
and enhancers of the T gene family. 

30. Vertebrate gene and functional equivalent, derivative 
or a bioprecursor thereof, which code for a protein 
having a statistically significant amino acid sequence 
homology to the T gene, T2 gene or T3 gene according to 
any of the following figures: figure 1, figure 9, 
figure 11, figure 12, figure 13, figure 14, figure 15, 
figure 16, figure 17, figure 18 or figure 19. 

31. T gene and its vertebrate orthologs and vertebrate 
paralogs which code for a nuclear pore protein. 

32. Vertebrate protein which has an amino acid sequence 
according to figure 1 or an amino acid sequence which 
differs from the amino acid sequence in figure 1 by one 
or more amino acids. 

33. Vertebrate T, T2 or T3 gene and the protein encoded 
therein in all of its naturally occurring allelic and 
mutated forms. 

34. Medicament containing a protein according to claim 9 or 
a functional equivalent, a fragment or a bioprecursor 
thereof in combination with a pharmaceutically 
acceptable carrier . 

35. The method of identifying substances which has an 
enhancing or inhibiting influence on the effect of T 
protein, T2 protein or T3 protein, by means of 

determining the bi-directional transport through 
the nuclear pores, 
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determining the binding to filaments of the cell 
(e.g. actin filaments and microtubuili ) or 
determining the increased or reduced transcription 
of cellular or reporter genes. 



36. Method of identifying substances which have an 
enhancing or inhibiting influence on the effect of 
proteins which are functionally linked to the T protein 
in direct or indirect way, or represent parallel signal 
or functional pathways, by means of 

determining the bi-directional transport through 

the nuclear pores, 

determining the phosphorylation and the 
dephosphorylation of proteins, 

determining the binding of the T protein to 
filaments of the cell (e.g. actin filaments and 
microtubuli ) , or 

determining the increased or reduced transcription 
of cellular or reporter genes. 



37. The method according to claim 35 or 36, wherein the 
modified transcription with reporter molecules, 
preferably the occurrence of certain mRNAs or the EGFP 
protein, is detected. 



38. The method of identifying further proteins which play a 
role in the development and function of the nervous 
system and/or are a nuclear pore protein, wherein the 
method comprises the steps of: 

(a) producing an antibody against a protein according 
to claim 9, 

(b) contacting a cell extract with the antibody and 
identifying the antibody/protein complex, 
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(c) analyzing the complex to identify a protein which 
has bound to the protein of the complex and is no 
antibody, and 

(d) optionally repeating steps (a) to (c) to identify 
further proteins of this function. 
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Abstract of the Disclosure 

The invention relates to a protein (TP) and to proteins 
related thereto, which are involved in the development of 
the nervous system, especially the central nervous system, 
and are expressed in a tissue-specific and development- 
specific manner as well as to DNA sequences coding for these 
proteins. The invention also relates to antibodies directed 
against these proteins or fragments thereof and to anti- 
sense RNA or ribozymes which are directed against the 
expression of said protein. Finally the invention concerns 
medicaments and diagnostic processes in which the above 
mentioned compounds are used. The invention further relates 
to a non-human mammal whose TP-coding gene is modified. 
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Human genomic sequence 
1 GATCAGACTT TGAAGAGTGT TTGTACCATG CTAAAGTTTA CAGAATTTAT 
51 TCCTGCTCTT TGAGGGTGCA TTGCAAATCC AGGC TAG AGG GAG AG AT AC C 
101 AG TT AG G AXA GTACAGCAAT ACTCTACTGG GAAATGGTGA GGTGTTTCGT 
151 GAAGACAATG G C AAC AC AG A TGAAGACATG CAGATGGAGG AAATAAAGAT 
201 CCAGTTGAGC TTGTTGGC C A GTTGGATAGA GGTTGAGGTT ATGCATGATG 
251 GAGCAATCTA GGTTTTTGTC TTGGGTAGGT GTTTCCATGA TAG T AC TC AG 
3 01 AATGAATCAT ATAGTTGTAC AGGTTGAATC CCACCCATGT TTGCACAATA 
3 51 GAGTGACTGT CTAGC TG AAA TCCAGATGAC ACTCTGTATG CTAAGCTATG 
401 CTTCATGGAA CTGTATAAAG GCACTTGCTA C ATAGGC TAG TGGCAGATCT 
451 GGAAGTAACC TATATGGTAT ATAGGAAATG AGGTGGCTTT TGTATAAATC 

5 01 CTACAGATAA ATTTCATTTC CTGATCCTAT TATTTTGACT CATGTTAGCC 
551 CAAGAAGAGT ATTCAGTACT TCATATCCCT GAAGGTAAGA C AG AG TAG TA 

6 01 TTAGATTCAC TATTTGGCAA ATAAAAGGGA TCAAGTCCTA AGATCAAGCT 
651 GATGAATCAA CACCTCATAG GATATGTCCC AACCAATTAT ATGGCTTCCC 

7 01 CTATAAATAA AATCTAGTTC TCTTCTCTGG AGAGGAACAG TGAAGAATAT 

7 51 CATAACCTAT GCTACAAACT GCTTGAGTAG GAGCTACTTC TCTCCAAGGC 
801 TTTATATCAT TCATTCTGGC AGGCCCCTCT GTTTGTTCTC ACCAGCTCCT 

8 51 GGGAAATTTA TTTCTC CTCT AG TG AT AT AA AAGCTCTCTG TTTGAGATGA 

9 01 AGGGCTGCCC AG TTT ATC AG ATCTGTATTA GTCTGTTCTC AGGCTGCTAA 
9 51 TAAAGACATA CCTGAGACTG AG T AATTT AT GAAGGAAAGA GGTTTAATTG 

1001 ACTCACAGTT CCACATGGCT GGGGAGGCCT CACAATCATG GCGAAAGACT 
10 51 AATAAGG AG C AAAGTCACAT CTTACATGGC TG C AG AC AAG AGAGCATGTG 
1101 CAGGGGAACT GCTCTCCATA AAACCATCAG ATCTTGTGAG ACTTGTTCAC 
1151 TATTACAAGA ACAACAGACA GGAAAACCCG CCCCCTCAAT TCAATTACCT 
12 01 GCCACTGGGA CCCTCCCACA ACACATGGGG ATTATGAGAG CTACAATTCA 

12 51" AGATGAGATT TGGGTGGGGA TACCGCCAAA CCATATGAAG TTCTTTCTTT 

13 01 GTTACTGGGT ACCATATCCA TTCTGTTGAG GTTCTGAGCC TTTCCAGTTA 

13 51 CTGTAACTCC TCTATCTCCT GTCTGTGCTA AGACTCAGTG ACCTCTCTCT 
1401 GCCTTGCTTC TGCTTTGTCC TGACCCTTTC TGTGCATGCA CTCACTCTAG 

14 51 TTTGCCCACC TGAGGTGAGA GATGGTCCAG ATT AG C AAC A ACAATCTGTG 
1501 GACTAAAATC CTCTTTAGGG AGGAAGCAAA ATTCAGATGG ATGTTACTAA 
1551 ACAAAGCTCA GAAACAGAGA C C AGG G TG TG GGAAGTAAGG TAG TAG C C TG 
1601 AGAGCAGCTG GCAGTGTTTT AGACCTGGAG GGAGGTTAGG TCATCAGCAA 
1651 TSAGGAGACT GCCTGGAAAA TCCTAGAAAA TTAAGACATC TGGTCAGGCA 
17 01 AGGTCATATC ACCAGCACAC TTCCCTTTTC AAGTTGAATC CCTTTCCTCT 

Fig. 2 
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17 51 GTTAAGAGGA TTCAAGTGTC TTTCTTGCAT TTTGTCTTCT CTTCTATATC 
1801 CATGCTTGCA ATATAAGGAG ACAGCAGTTG GCTGTTTGTG C T AG AAAATA 

18 51 TAAATGGCCA TTTTGAAAGC ATGC C AG AC A GGATCTGCGG CAAGTTTTCA 

19 01 ATGTTACTGC TGCCATCTGT TGTTCTTCAG TGCTGGGATG TGAATCTCTT 
19 51 GGCAAACATC TCTCTAATTC TGAACTATCT TTCACCCCCA TC TAG AG AT A 
2 0 01 TTCACTTACT GAAGTGCCTT TTTAAAGCAA TGTTCCTCAC CAAGGCGATG 
2051 TTCTGAATGT TTTAAAATGG AAGAATCTGG AATGTTTTTA TTATAATACA 
2101 TTTTG TAT AT CCCAAAGCAA AAATCAATTT CTTCATGGTT AATACTTTTG 
2151 TAATTTTGTT TTTAATAATA TTTTCCTTTT AAATATAAGA AATATTTTAT 
22 01 TGAATTAATA CTTTAATGTA GCTGTTTCAA GTAAGATAAA ACAGAACAGA 

22 51 TTACTGTTTT CAACCTTGTT C AC AG T TAGC TCTGTAACTA AGTTGTTGAG 

23 01 CTTTATCTAA GCTTTTTTAT TTTTACATAA CGTTTCCCTT TTCACTTAAC 
23 51 C TTGAAATT A TAGTAATTTG GGAACTTCTA TTCCTCTGAA AGAGAAAGCT 
2401 AATGCCAAAG ATATTTCAAG GGAGAAAGAA GGTTTTTAAA AGGAGAGACA 
2451 ATTCAGCTCA GACTTAATAG CTGTGATTGC TATTTATTAA GCAGAACGCC 
2501 T AT AAC T AAA TTCTCAGATA TCCAAAAAAC AGCCTGTACA TTCTCAAAAG 
25 51 TGAAGATTAC AC ATTTTC T A AGTTAAGGTA AAAGTTTTGT CTCTGTAGCA 
2 6 01 TCTTACTGAT TTCTATCTTC TCATTCTGCC TTAATAATGT C AC TAAATAA 
2 651 ATGTTTGATG CACTAATACA TGAATAAAAC TATTCATGGT AATGATTCTT 
27 01 TAGAAACACA GCTAAGTTTT GTAATTTTGT TTTTTAAAAA TTAAAAATTT 
27 51 AAATATAAAA ATGTTTTTAA AAGGCTTGAA TTTCTTGTTA AATG T AC AC A 
2 8 01 TTTTAAGTTG TAGGCTGTCT TTAAAAATAA TCTCTCCACA CACTGTAGTA 

2 8 51 TTTAAAACAT CATGATATTA C TAT AAAAC A TCAACAAATA GGGCAGTGGA 
29 01 AAACATGGTA ATCACTAAAA ATGCTCACAT GTCATATATT AAG AC TTG AT 
29 51 AAGTAAACCA CAATAATAAA TAGAAAAGAA AT AG TTG T C T AAAAAGGGAT 

3 0 01 TCTCACCTTT CAAACCTTAC CATAAAAATG GAATATAAAA GAAGGAAGAG 
3 051 GAGGAGAAAT CAAATTATAT CATAAAATTT TCTGGGCAAA AATATTACAG 
3101 AAGAAAATAA GAAAGATTTA TGGAGTTGAC TGAAACATTT TTGAATCCTA 
3151 TACATAAAAA TATCGTTAAT TAAAAGGAAA AACAAAGAAA CAGATTTGGG 
3 2 01 AAATATTTGA AACTGGTTTT TTTTTAGCAT TTAAAAATGT AATACAAATG 
3 2 51 GATTATTTAA ACTCCATTGC AAAAATACAC AAAGGACATT GACAATGTCT 
3 3 01 GGAAATAAAA TT AG C T AAG T AAGTTATAGA AAAACTCAGT CTCACAATTT 
3 3 51 G AC AAATG T A ACTGAAAACT ATTAATATAA TTAGTAACTA TTTTTACATG 
3 4 01 TCAAAATTTT TGAATTACTA AAGGAAACCA CAATGCCTGA AAGTATCCAG 
3 451 GGTTTTTTTT TTTTTTTATA ATATTG G C AC TGTCATATGG GTGGCAGGAA 

Fig. 2 (cont'd 1) 
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53 01 TTACTTCCAG GTTTTTACAC TTATTTTTAC AATAAATGTG AAATACCCTT 

53 51 TTTGACAATA TCTACAAATA TTTCTTATTT GTCTTTATTG CTCTTTCCTG 

5401 TAATGTTTAG TCTTCATTTT CCTGATAATG GCTATCTAAA GTTATCTCCT 

5451 CAAAGAAGCA GTTATTTATT CACCCAAATC TTCTAGTCCT TCTCTGGAGT 

55 01 TTTCTTCTCA CTTCATTCCC TTGGTTTTTG CCACAATTTG TAATAATTTG 
5551 CAATTTGGAG TGTTAGAATG AGGGAATAAA TCACAGGTAA TG AC TAT AG T 
5 601 TTGTGACTAT GTAAGATTGG ATTCGTTATT GATTTATTCC ACAAACACTG 

56 51 AGGCACTGCA TTT AGC C AAA TGCCAATCTT GGGCAGTGAG ACTCTGAAAG 

57 01 AGAATCTGCT TCCCCCACCA TAAACTACAA AGTGAAACAA CTCAGAATGT 
57 51 ACATAAATTA CAGAATGAAA G C AC AC TAG A AG TAAAC AC A GATGTGGAAG 
5801 AGGTAAAGTG TCCTTGAAAA TCATGGAAAG ATTCATAAAG GGAATGACAT 
5 8 51 TTCAACTGGA TTCTAAACCA GTTATTCAAG CTCCACAAGG TTGCACAGTA 
59 01 AATGAGCAGT GGCAGGATGA CATACCTTAG AAAGTAAAAG GAATCTTTTT 

59 51 TAAAC TG C T A TAAAAATCAT TACATATACA TTTTGTAGGT CGAGAGTAAG 

60 01 GTATTTAACA TAAAATCATT TTAGTATATC AGTGTTTATA TAGACTTAGG 
6051 TTTTTCTCAT TTAAAACCTC TTTTAATGAC TTGTGCTTTT CTTCATGGTA 
6101 ATAAAACATT TTCCCAGGAA GTGCTGAATA AATCTTTCTT GAAATACGTT 
6151 TTATTGCTTT CTATCAATGA CCCTGAAGTA ATACAGAATT TACACTTCAG 
62 01 CGGTTGCAAT GCTCAAACTT GACAGGTAAT GCACTGTGTT TGCTGATATA 

62 51 AGAGGTATGA TGTAGGGCTA AGTGGTTTTG TGCTCATTTA GCTTTCAGGA 

63 01 GAAAATAATT GACTTAACAT TTTG AT AC T A AAACCCAAAG CCTAACAGTT 

63 51 AATTCTTGGT ATTTTAAATT ATTATTGCAA AGATTATTGT GCCGAATAAT 
6401 ATGAAAATAT TTTATATAAT ATTTAAAAAG TATATCTCTT TCTTGGTATT 

64 51 ATTTAAATTA CCATAAAAAT GTGCGAAAAA GTTATACTGA AATG TG AT AG 
6 5 01 GATCTTTTAA AAGTGGTGCC TTGATTTTGT TAAGTGTTAC CTAGTTTTCC 
6551 TCTGAAAACA AGAAACATAC C C AG AAGTTT TCACGAAATG GTCTCATGAA 
6601 TATCTAAGGT TAG TC C G T AG TCTCATCTGA GACAAGGAAA GTCCCTTCCA 
6 6 51 CTATGAGCCT GTAAAATCAC AAGCAAGCTA GTTACTTCCT AGATACAATG 
67 01 GGAGTACTGG TATTGGGTAA AC AC AG C TG T TTCAAATGGG AGAAATTGGC 
67 51 CAAAATTAAT GGGTTACAGG GCATGCAATT CCGAAATCCA TCTGGGCAGT 
6 8 01 CAAATTGTAA AACTCCAAAA TGATXTCTTT TGACTCCATG TXTCACATCC 
6 851 AGGACATGCT GAXGCAAGAG ATAGGTTCCC ATAATCTTTG GCAGCTCTGC 
69 01 CCCTGTGGCT TTGCAGGGTA TATCACCCCT CCCAGCTGCT TTCACAGGCT 

6 9 51 GGCATTGAGT GTCTGTGGCT TTCCCAGGAA CAAGGTGCAA GCTGTTGGTG 

7 0 01 GATCTACCAT TCTGGGGTTT GGAGGATGAT GGCCCTCTTC TCATAGCTCC 

Fig. 2 (cont'd 3) 
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r~< 7\ IPJPPTRT 


n a r* a TTi rr r* t 


PP,APAPATTT 


TPPPPATTPT 


7501 


CTTGGGGATT 


A A P* A prpppPfTl 


L,(_ 1 1\j 1 1 AL, 1 


TSTPPAPATT 
X i\ X X X 


TPTPPAPPPA 


7551 


GCTGAATTTC 


TCCTCAAAAA 


ATGGGTTTTT 


/"»rprprprpf~irp >, t »rp 
L 1 1 1 1 L 1 AL 1 


f^C* A r T 1^ PP n^ PP , a P* 
bbril IbJ-LAb 


7601 


GCTGCAAATT 


TT C TG AAC TT 


TT ATG C TG TT 


m^p p*fnrpfTirp A A 
1 LLL 1 1 1 1 AA 


a a tppp a fppp 

/Vrt. 1 LaLvxtt. 1 L>L 


7651 


TCTAACAACA 


CCCGTCACCT 


C TTG AATGC T 


TTbL 1 bL 1 1 A 


P" A A A rprprpz-inpT 1 

Lj AAA 1 1 1 L 1 1 


7701 


CTGTCAGATA 


C C CT AAATC A 


TCTCTCTCAA 


rrrvrip A P* 7\ prprn 

Gil LAbAb 1 1 


p>p» A O A a A rnpfp 
LLALAAA1 L 1 


7751 


CTAGGGCAGG 


GGCAAAATGC 


CACCAGTCTC 


rprprpP'P'T* A A 7A A 
111 GL 1 AAAA 


pimj TipTi A P" A 
LA X AALAAbA 


7801 


GTCGCCTTTG 


CTCCAGTTCT 


7\ f~*f* 7\ A P«rprp/~i 

L, AGC-AAtj 1 IL 


pmp A 'T x {~* r P^ t (~ , A 
LiLAILl tLn 


ILL bnbnb Aft 


7851 


CCTCAGCCTG 


GTCCTTATTG 


1 1 1A1A1LAL 


i/ilAAnAAl 1 


rprprp/-irp/—i 7, a Ar; 
1 1 lbl Lnnnu 


7901 


C C AT TC AAC A 


AG 1 L1L1 AL 1 


PP* AAA rrprpmr" 

LLAAALi 1 IL 


pp A P» A fprprprpp 
LLALAi X X XL 


f~* T 1 P^ T'P T 1 T 1 f~* T 1 T* 
LlulLl IL 1 1 


7951 


CTGAGCCCTC 


LAAA1 lbl IL 


p* a p , p , p"T i P ,r PP , p* 
LAbLL I L 1 bL 


PTP,ATAPAPA 


PTPPP A A APT 


8001 


TACTTCCALA 


11111 bb A X 


rpp»rprprpmp» A P2P' 


A ATPPPPPPP 


TPTAPTCPTA 

X W X ^W^r X W\J X 


8051 


CCAACTTAC 1 


'piPP TxPLPrpPP 
1 lbl lAblLL 


p- rnmrprpp JpTip 
Vj> X 1 X X V_~ttA— nL 


TPTTPATA A A 

X O X X VT.Pl. X /VfVrt. 


PAP ATAPPPA 


8101 


AbAL 1 bbiUift 


P* 7i A A A A Zl. 7\ P2P^ 


TTT A A TTGG A 


PTTAPAGTTP 

X X /lLrilVJ X X 


CACATGGCTA 


8151 


bbbAbbL 1 IL 


jvpTia fpp a fpr:p 

J\\^,J\f\ X L>rt. X 


P APPAGGP A A 


A AP4PPATTTP 


TT A P ATG ATG 

• XX flvii X Ljix X \J 


82 01 


P'P 1 A P'P' A APTiP 1 

b b AG L AAbAb 


a a a a ttz a nri a 


APATGP A A AP 


GPAGAAATPP 


PTGATAAAAC 


O O C 1 
OZD ± 


71 mppp a ppm 
LAlLbbALL I 


lbl AAbAL X X 


jT\ X 1 l_-riv- X >rtv^ v- 


APTAGPtACAP 


TATGGGTGAT 

X A X V-JVJVj X VjA X 




ALLALLLLLn 


t>p* a r n r PP" a a a t 

lbttX lLAnnl 


PATPTPPAAP 


P AGGTGPPTP 


PC AC AACACA 

L^V^i^L^/VriL^iiL^rV 


o3 bl 


■ m/~ , /~'p' a a nr»(T* a t* 
1 GGG AA 1 1 A 1 


ppp ?\ A »"p A P 1 A A 
L>bb/A>i 1 ALfvH. 


•PTP A A P A TP; A 
X 1 \~r\s\\2r\ 1 on 


PATTTPPPTA 

VJTXi XXX OVJVJ X 


GGGAP APAGA 




bLLAAAL i A 1 


A '"PP 1 A P 1 A TPP a 
A 1 LALA 1 bbA 


TTT^PTT A T A P 
X X 1 V— X X J\ X /t.\w 


X X X X uv. X X X X 


AATAAPAPAA 


8451 


ACAAAAAAAT 


ACATCATTAA 


AAGGTTAGAA 


GTGAGAAGGT 


GTTTTTATGG 


8501 


AAATCAAAAA 


TAATATCACC 


T TAG TG AAC A 


GTATTCTTAT 


GATTGTAGTT 


8551 


GAATTAGAGA 


G C AG AAT AC A 


TC TAG AAG AT 


TCAGTAGTAA 


GCATGTTTCT 


8601 


TCGATTAATG 


GAAAATTTGA 


AT AG C C TAG C 


TGATTGAGAT 


TGAGGTTACT 


8651 


ATTAAATGCC 


TG AAG TAT AA 


GAGTTGGTTG 


TTTATGTAAA 


CAAAATATCT 


8701 


GTTTTACATG 


TACATGTGTA 


AG TAG G AC TG 


TTGAGCCC C A 


GTAACATGAA 


8751 


ATATCAAAGA 


GCATGACTCG 


AAT AC C TG C C 


AT ATG AAG TG 


CTATTACATC 


8801 


AAAAAAGAGG 


CGTGTGCTGA 


AAAATTACCT 


ACAAATGGCA 


TTTTCCTCAA 



Fig. 2 (cont'd 4) 
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8851 


ATCAATTTTA 


AATCTTCAGA 


8901 


TTCAGAATCC 


CTCATCATAA 


8951 


GTATAACACA 


TTTG TTTTC A 


9001 


AGTGAATCCA 


GAACTACAAA 


9051 


AACGTTAAAA 


ATCTAAAAAC 


9101 


CTTGGAATCA 


TTAATTAACC 


9151 


GTTTCGCTAG 


CTACATTTTT 


9201 


ATTATTGTTC 


AGTTCTGAAT 


9251 


TACCACAGTC 


TTCCTGATCA 


9301 


TCCTTTTCTT 


AAATGTTG C A 


9351 


GAATCTGGCA 


TCTTTAACTT 


9401 


ATGTATCAAT 


GGCTCAATCT 


9451 


GATAAATATC 


TTTC TAAATG 


9501 


TAGGATAGAG 


TC C T AG AAAT 


9551 


TTAAAATATT 


TGATACATAT 


9601 


TACATTTATG 


ACAGTATGGG 


9651 


ACATTAATGA 


TAATACATAA 


9701 


ATGTTATATC 


CTTATTAGCA 


9751 


TACAGCCAGA 


GATATTTGGT 


9801 


TGAAAATTTA 


GGTGAGCTAT 


9851 


TATAAAGTGG 


GGATAATTGT 


9901 


AC TG AATTAC 


TATAGAAAAT 


9951 


ACTCTCTCTG 


TATTACTTGT 


10001 


AGCTCAGTTT 


CGTTATATGC 


10051 


TTTCCAATAA 


AATTATTTGT 


10101 


CTTACTGGTT 


TGTTTCAGGT 


10151 


TCACATTTTA 


GCAAACTTTT 


10201 


TGTTTGCAAA 


ATGTAAAAAA 


10251 


TTTATCTTTT 


TTATTGCTTT 


10301 


AAATATATAT 


TTGATGAAAT 


10351 


TTGGAATTGA 


ATTTGGCATA 


10401 


TATTCAAAAT 


TTCTAACAAA 


10451 


ATTGTTTCTT 


CTGTTTCCTT 


10501 


AGACTTAAAT 


ATT AG T AT AA 


10551 


CTATGAATAA 


AAGGCTTTCA 
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ATTTCATTTT AATAATTGTT TAGTTAATAT 
AAAGCAGGCA AAAGGCAAAA GTCCTTGAAT 
AACAAGCCTG CCTCTAACTG TGAATCCAGG 
TTAACTAAGA TTGGCCCCAT CGAGTTACTG 
TAAAAGGCAT GCCTCAACAA TTATTTTCTT 
TATGTGTATC CAAACAATAA TCTTCCAGCA 
AATTAC TTAA TATCATGTAA AATTTGTTTT 
TTTGACATAT GCATCAAGCC ATGCAACTGC 
CTGATCTGTT CTAAATCTCT ATAGCATTTT 
TAAATAAAAC CATACCTTAT GTGGCCTTTT 
AATGCGCTTG AAATTAATCT ATGTCATTTC 
TTTTAATTGT TAAGAAAAAA TGTATGCTGG 
AGTTTTTGTT CACAATGCTG AGTGTTTGTT 
GGTATCACTA GGTCAAACAT TCAAATAATT 
TGCCAAATAA TCTCAAATTT TTTACCAATA 
ATAAATGTGT CTTTCTTATA CCAACTGACA 
AATATTCTTT GCTAATTTGA TGGGACAGAA 
TTTTATTATT GTGGTTGAAT GACTGTACTG 
TCAAAATCCA TCTTCATTAT TTACTGTATG 
TTAATC TCTT GATGCCTTAG TCTCCTAATC 
ACCAATCATA TTAGGTTCCT GTGAGAATTA 
GCTTAGAATG GTATCTAGTC ACCAGGAAGG 
TTATTATCTA ACACGTTTAA TTATTAATGA 
TTGGGATATT TGAAACTTTT CTTAGTGAAT 
CTATTTTTCT ATGGACAAGT TGGTATTATT 
TCAGTTAGTA AGAATTTTAA GGATTTTCTA 
TCTGCATTTT ATCTTTTTTC TTTCAGATAA 
AACAAAAGGT TTCTTCATCA AGTTGGTATC 
GTGATTTGAA AATTCTTGTC CTGAGAACCA 
AGTTCTCTTC TTTT AC TC AT TCTGAAGTCA 
TGATATAAAT CCTAATTTTA TATTTTATGA 
TATTTACTTA ATAATCTAAT CCAGGTTTCT 
TATAATGCTT TTTCTGAAGT TATTTTTCCT 
TAT TAT C ATA GAGGAAAAAA TATCTGTTAG 
TCTTATTGTT GCATTAATAT ATTTAAATGT 

Fig. 2 (cont'd 5) 
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10601 


AGAGAGCATA 


CAGATTAGCA 


10651 


TTGACATGAA 


C ATG TAT AAA 


10701 


GAACTTATTA 


GTGAATTTAG 


10751 


AAATTC C ATT 


TTATTTATCT 


10801 


TTAAAT ATG C 


CATTTACAAT 


10851 


AGGAATTTAT 


AAAATGAAAT 


109 01 


GAAATAAATG 


AAAGAAGACC 


1U J J J. 


TTGAGAGACA 


ATATTGTTAA 


11UU1 




ATGGTGAACA 


11UD1 


c a ii rPT a T"T"T I 


PTATAATTCA 


111UJ. 




GACTGTCTGG 


-i -1 -t C 1 


1 V_ 1 ± k-.rt.VjL,^ 


APATPAPRAP 


"1 1 O 1*1 1 

llzUl 










TP^PPAAPAfiA 


11 j Ul 


21 r* ZlTUPPPAT 
,rtAj,rl x X 


(^TAnTPAGCC 


X X J _> X 




APAPTGPAGT 


1 1 a n i 


aaTA APPAPA 


CCCTAACCTA 


i i / ci 

i X J X 


TTTP,TAP,PPA 


AAGAAATCAT 


X X _> \J -L 


ATPAAAP^PPA 


CAGTACCCTA 


11551 


AAAAAATTTT 


C C C AT ATG AA 


11601 


TTACAACAGA 


TATGCAGATA 


11651 


AAGAAAGAAT 


CTC AGAAGG T 


11701 


GACAAAATTA 


AAAGAGAATA 


11751 


TTAAAfiTGAC 


CAAATATACG 


11801 


AAAGAAAAGA 


TT AG AAAAC C 


11851 


T AAG TC T AGC 


AAGAGTTTTA 


11901 


- CCCAGGCCGA 


TAAAACGCAA 


11951 


GACTGTTTAA 


AGTCAAAGAT 


12001 


AGTGTATGAT 


AACCTATGAA 


12051 


TGGCAGAAAC 


TTTACAGGCC 


12101 


TTAAAGAAAA 


AjyyvAACTAT 


12151 


TCCTTCATAA 


ATGAAGGAGA 


12201 


GAGGTAGTTT 


GTTACTACTA 


12251 


GTCTGGAAAC 


TGGTAGTGAA 


12301 


GA&AGTATAA 


AACTCCCTGG 


12351 


CCACCAGAGA 


AATCTAACTA 



AAGAAAAAGT ATAATTGCCT TTTTTTATAG 
GAAAAACCAA AAAAATCAAT AAAACAACTA 
CAAGATCATA GCATACAAAG CCAAGATTCA 
ACTAACAAAA AATATTTGAA ATTTGAAAAT 
AACATCAAAA TATTGAACAA TAAAGTATTT 
CTCCTATACC AGGAATTACA GACCATTGCT 
AATATATGTG AAG AG ATA CT CATTTGTGGA 
AGTATCAGTA TTTC C C AAAT TAATCAATAG 
G AAC AC C AG A AGATGTTCTG TCGAAGCTGA 
AATGGAAATG CAAAAGGCAG TCACTGCCAA 
GTTCCAGTAG GTTACTTCAC TACTGCCTCT 
AGCTGCCCAG AAGCCAGAGA AACTCCTCAC 
T AC C AG CATC CAGGCAAGCC ACCATCAGCC 
GGTACCACTG TACACTACCC TGGGGAACAA 
CACCTCTGCC ACCACTAGGG CCTGAAGCCT 
CCTCAGCACA GCTTCATCAC AGCTTCTGTT 
CCAAGGAAAT CACAAATGTC ACTGACACTG 
AGAGAGACTA CATTACTGCA CACACCCATA 
TCCAGACAAC ATCACAGGTA TATCTAAAGG 
AGCGAATTCA AATATAGGAA GAAGCGACTG 
AAGCTTCAAC AATATCCTAC ATTCAACCAG 
AAAGACAGGT CTTCTGAAAT AATC TAGTC A 
ATCAAATCCT TCCTGACATT TGGGATAACA 
AATTATAGAT ACCCCTGAGA GTGAAAAGAC 
CACTTAATTA AATAATATAT GAAAACTTCC 
G AT ATTTGG G ATGCAGGAGG CTCAATGGTC 
AAAGGTCTTA TACACAGCAC ATTACAATCA 
AAGGAATAAA TTCTAAAAAC AGCAAGAGAA 
GTAAACCTTA TCAGACTGAC AGCAAATTTC 
AGAAAGAATA GGACAATATA TTCAAAGTGC 
CAGCCTTAAA TACTATAGCC CACAAAATTA 
AATAAAAGGT TTC C C AG AC A CGAAAATGCT 
GACTGGACCT ACAATAAATG CTCAAGGGAG 
AGGACGACAT TTATCATCAT GAAAATACAT 
TAAGCAACTA AAGGGAGGTA TCAAATGTTA 
ACCACAATGA CAAACAATAA GGGAAAAAGA 

Fig. 2 (cont'd 6) 
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12401 


AAGGAACAAA 


AATATATAAG 


ACAACAAATA 


AACAACAATA 


TAACAGGAAG 


12451 


CCTCACATAT 


CAGTAATCAC 


TTTGAATGTA 


AATGAATTAC 


ATTCTCCACC 


12501 


TAAACGTTAT 


GAAATGCCTG 


AATGATAAAA 


CTATATGATC 


CAAATATATG 


12551 


CTGATTACAA 


G AAACTT AC C 


AGGCAGACAT 


ACATAGGCTG 


AAAG TAAAAG 


12601 


AATGGTAAAA 


GATATTCCTT 


GCAAATGGAA 


AGCAATAGTG 


AG C AGG AG T A 


12651 


GCTATACTTA 


AATTAGATCA 


TACAGACTTT 


AAGTCAAAAA 


GAG T AAAAT A 


12701 


AAAAAGACAA 


AGGATG TT AT 


TATATAATGA 


TGAGATTAAC 


CCAGCAATGG 


12751 


GAAATAACAA 


CTCTAAATGT 


ATATGCATTC 


AACACTAGAG 


AACTCAGATC 


12801 


CACAAAGCAA 


ATATTAG AC C 


TAAAGAGAGA 


AATAGACTGC 


AATACAGTAA 


12851 


TAG TGG AG AA 


CTTCAACACT 


CCACTTTCAG 


TATTAGACAG 


ATAATCTAGG 


12901 


CAAAAAATCA 


ACCAGTAAAT 


TTTAGATTTA 


AAC TAG ATTT 


TAGACCAAAT 


12951 


GGACCTAACA 


GACATTTACA 


AAACATTCCA 


TCCAACCACT 


GCAAAATGAA 


13001 


ATTTGTGTCA 


TCAGCACATG 


AAACAATGTC 


CAAGATAGAC 


CACCATATGT 


13051 


TAGGCCACAA 


ATCATGTCTC 


AGCAATTTTT 


TAAAAG TTG A 


AATCATATCA 


13101 


CATATCTTCT 


C AG ACC AC TG 


TTGAATAATG 


C T AG AAATC A 


ATGCCAAGAA 


13151 


TAACGTTGGA 


AACTATACAA 


ATACATGCAG 


ATTAAACAAC 


ATGTTCCTGG 


13201 


TTGATCACTG 


GGACAATAAG 


GAAATTAAGC 


TGAAAATCAA 


AAAATTCTTG 


13251 


TAACAAATAA 


AGATTGAAAC 


ATAACATATC 


AAAACCAGTG 


G CAT AC AGC A 


13301 


AAAGCAGTGC 


TAAGAGGGAA 


GTTTATAGCA 


ATAAATGCTT 


ACACTGAAAA 


13351 


AGTAGAAATA 


TTTTAAAATT 


AGCAACCTAA 


CAATGTGCCT 


GAAGAAACTA 


13401 


AAAAATCAAG 


AACAAATCAA 


ACCCAAAATC 


AGCAGAAGAA 


ACACAAAAAT 


13451 


AAAGATCAGA 


AAAGAACTAA 


ATCAAATAGA 


GACTAAAAAA 


ATACAAATGA 


13501 


TTAACAAAAC 


TAAAATTTGG 


TTATTCAACA 


AGATAAATAA 


AAT TG AT AAA 


13551 


C CG C TA G ATA 


GACTAAACAA 


GGAAAAAGAA 


TATCCAAATA 


AACACAATCA 


13601 


AAAACGATAA 


AGGAGACATT 


ACAACAGATG 


CCACAGAAAT 


AAAAAGGATC 


13651 


ATCAGAGACT 


ATTATTAACA 


ACTATATGCT 


GAAAAATGGA 


AAATATAGAG 


13701 


AAATAGATAA 


ATTCCTAGAA 


ACTTACAACC 


TACCAAGCTG 


TTGCATCAGG 


13751 


AAGAAATAGA 


AAACCTGAAC 


ATATCAGTAA 


TG ATT AG C AA 


AATTGAATCA 


13801 


GTAATAAAAA 


ACATCTCCCA 


ACTCTTTTAA 


AGC TTTGG AC 


CAAATAGCAT 


13851 


CACAGCCTAA 


TTCTACCAAT 


CATGCAAAGA 


AGAATACCAG 


TCTTC TTG AT 


13901 


GCTATTACAA 


TAAATCAGAG 


GAAGGAATTC 


TCTCTGGCTC 


ATTCTACATG 


13951 


ACCAGTGTCA 


CCTTGAAACC 


AAAACCTGAC 


AAGGACACCA 


CAAAAAGAAA 


14001 


ACTACAGGCC 


AATAACCATG 


ATGAACACAG 


ATGCAAAAAT 


CATTAACAAA 


14051 


ATACTGGCAA 


ACGGAATCCA 


ACAGCACATC 


AAAAAAATAA 


TATAC C AC AA 


14101 


TCCAGAGGGT 


TTGTATCAAG 


GATACAAGTA 


TGACTCAATG 


TAAAT AAATC 



Fi 9- 2 (cont'd 7) 
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1 / 1 CI 


AATAAAPA TH 


AT AAGC ATC T 


TCACAGAATA 


TAAGACAAAT 


GAATATATGA 


1 4^ Ul 


tp atptp a at 

X LA X V_ X v_/-\_r\ X 


APATGCAGAA 


AAAAATTTTT 


GATAAATTTC 


AAC ATC TC TT 


14zj1 


P A A A A A A A 


ATrTPTAAAA 


CTC AG CAT AG 


AAGAAACATA 


CCTCAATATA 


14JU1 


at 1 a a apppp A 


T ATG TG AC AA 


ACTCAGAGCT 


AATATCATAC 


AGAATGGGGC 


14 JdI 


AMb 111 AAA 


ft; A fTTTPPTT 

VJJ^T.^, X J. J. V_V_, J. V_ 


TAAGAAPTGG 


AACAAGACAA 


G G ATG C AAAC 


1 4 40 1 


rppfTP 7> PP ?a fP 


p p t a tpp a p a 


TAPTAPTAPA 

X jrivJJ X /A X nun 


APTPPTAGCP 


AAAACAATCA 


14451 


OTvOATiPPTiTA ?A 
LALAALL AAA 


APAAATAAAA 


APTATPTAAA 


TTGAGAAGAG 


CAAGTAACAT 


14501 


nv~* rp rp /~» o rpp T 1 T* 
1L 1 1 LL it- 1 1 


1 LL 1 LA 1 k?A X 


A TPPTTTTPT 
X ijjtjj X X X X \j 1 


ATPTGG A A A A 


TAPTA A A A AP 


14551 


TClAGlAAAA 


ALL 11 — L X ALA 


mrprp/™; ts rprp t\ tv rp 
XXX oA X X /vrt. X 


TA ATTT APTA 
X/\r^X X Inv XA 


A APTTTP A PP 


14601 


ATACAAAATA 


71 7\ 71 tv rp 7a p 7i 7\ 7a 

AAAA1ALAAA 


AL X LAL 1 ALL 


AX X lLlnl uL 


PPP A ATA ATA 


14651 


AAATAGCTAG 


G AAAG AAA 1 L 


AAbAAAb 1 LA 


"PPPPTlT HPT 1 7A 7a 
XLLLAX X XAA 


7\ rpT 1 7\ PPT^Ta pa 

AX iAuLlALn 


14701 


AAAAATTAAA 


A 1 ALL 1 brLLA 


A X AAA X L AAL 


P A APTTA A A P. 


7a TPTPTPP A P 
A X ^ X ^- X uLr\L 


14751 


AAAACTACAA 


AAL AL I LtA 1 La 


7A 7a 7A C 7a 7a 7a T^T* 7a 
AAAvjAAAX iA 


A PP A TT A A A P 


A A A P A A A TTP. 


14801 


AGAAACATCC 


LAIbl 1 lAlb 


C~i 7A rp/"' 7A 7a 7a 7a C 7a 

LA 1 L AAAALA 


7a TT A A T A TP A 


TT A A A A TP A P 


14851 


CATACTTCCC 


AAAbLAAl I 1 


OP 7A P 7A rprpP' 7a 7a 
LLALAX X L AA 


rpppTi A T u T"T , PT 
XL?^— AA X X X L_ X 


Zi PP 7a A 7\ rprp t\ p 
/a.V„ V t\l\J\ X 1 AL 


14901 


CAATGTCATA 


11 ILAxALAA 


1 1 AL AA 1 AA 1 


PPT 1 7a A 7a 7A r PT , a 
LL X AAAAX XA 


prp t\ rp/™i/-i 7\ 7\ rpf 


14951 


AGAACAGAGC 


LLAAAiAbLL 


7a 7a 7a PP 7a 7a T"TP 
AAALLAAX 1L 


TP! AAPATAAA 


P. A AP A A ATPT 


15001 


GGTLLl bAL 1 


rp 7\ 7s rp/™' 7V prp 7\ rp 
1 AA 1 L AL 1 A 1 


PPTa 7a TT^T 1 2t TTZ 
LL AA X L X A X vj 


P ATPTA AP A A 


A A TTP A A P A T 


15051 


GGATTTTATC 


AA1 1 1 Li X ALA 


ATaT'TaTaATaAAA 
AAX AAAAAA-ft. 


TP.T A A A A A A A 


PA APA A APPT 


15101 


bbAbbt- 1 A I A 


pmjv ppp 7a 7a 7A 7A 
L X ALLLAAAA 


PSPPiTPPTA 


TTTTTAP AP A 

X X X X X /-lo-rl.\_^i. 


A A TPP A ATPP 


15151 


AA 1 AbAAAbC 


rpP 7A P 7V 7i 7\ rp 7\ A 

1 LAbAAA X AA 


AP.PP AT AT 1 AT 


ATATATTHTP; 


TPTPTPTPTP 

X X V3 X VJT X VJJ X VJ 


15201 


ib iblAl AtA 


LAL A X AUKI 


TATATATA AT 
X A in X f\ ±J-\J-\-X 


PTPTAPATAT 

V5 X VJ X rAV^A X jTV X 


A ATPTTTTPT 

ArV X \J X XX X \_ X 


i J/ jl 


ALA 1 L i IblA 


7S rp 7\ rprprn tv rp 7\ rp 
AXAX X XaX/\X 


TPP A TTPP A T 


TATAPATA TT 
x ^i. x x .r\ x x 


PP A TTTPTPT 




A1A1 ALL 1 lA 


T A T* A CZ A A TTP. 


GAAGAPTATP 


TGPPATTAAA 


AAGAATPAAA 


n c T c 1 

id J j! 


T i r'r ,r rr ,r PP 7a *T ir P 


rppp 7\ r^p A AP A 


TPPTTPA A AP 


TGGAGTTCAT 

X VJVJ^iVJ X X X 


X xi. X X X ixiivj X 




pa ta tat a atpt 


AHHPAPAAAA 


AGATAAATAT 


CACATGTTCT 


C AC TT AT ATG 






TAAPTTfiATT 

X fiftv, X X VJ-fT. X X 


APATGAAGGT 


GGAGAATGGA 


AAGG T AGG T A 


ID jUI 


r*r* 7a?a2ip7aP7ap 


7i PTPP A A APP 


ATGAATGGAG 


GGTAGGAGGG 


AAGGTGAAGA 


15551 


GAAGAGAGTT 


AAAAGGTGTA 


AACATATAGT 


TAAAAGAAAT 


AAATTCAATG 


15601 


CTTGATAGCA 


GAG TAC AG TG 


ACTACAGTTA 


AC AAAATG T A 


TTATACTCAG 


15651 


GTGATGAACA 


CCTAAATACT 


TG ATC AC TAT 


GCAATTATAT 


ACGTGTAACA 


15701 


AAATC AC TAT 


G C AC TAT AT A 


CGTGTAAAAT 


TAAATGCGTA 


CAAATAAAAA 


15751 


TAATAAAATA 


CTAATCCAGT 


ATCATTCACT 


GACAATGTTA 


ACTCAGGTGG 


15801 


AT AGG C ATT A 


AGTCAATACT 


ACTATAAGAA 


CCACTTCTTG 


TTTATGTTAA 


15851 


TGCCATATAG 


AATGAAATAA 


AATTCACTAA 


AATCCAAAAA 


ATTAGAAAAA 


15901 


CTATCAAAAC 


TCAATAATAT 


TAAGACAACC 


CAATAAAAAT 


GTGGTCAAAG 








Fig. 2 


( cont 1 d 
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15951 GATTTGAACA TACATGTCAC CAAAAAATAT ATTCAAATTT CCAATAAATA 

16001 CATGTAACAA TGTTCGACAT CGTTAGTCAT CAGAGAAATA CAAAATAAAA 

16051 TGGTAATGAG ATACTACTAG ATAGGCTTTT AC AG AG AC TG ACAATACCAA 

16101 GTATTGACAA GGATATGGAG CAACTGAAAT TCTCATTCCT TGTGGTAAGA 

16151 ATGTACAATT ATATAACCAC ATTGAAAAAA CAAGTTTTCA GTTTCTTTAT 

162 01 TCACCCAAAA TATATGTCTT TTGGAAAAAA TTTTTTCCAG TCTGTGGGTT 
16251 GTCTTCTCAT TCTCTTGATA TATGTCTTTT CAAAGAGGCT GAGCTTTACT 

163 01 TTAGACAGTG GTCATCAAAG TGTGTATATT TGTGTTTTTA TAATTTATAT 
163 51 GCATATATTC CTGTGAAAAG ATACTGTATG CATTGTTCAA CATGTACAAA 
16401 TATAAGAAAG ATATAGTAAA GAAATATATA TTTCTAAATT TATAAATGTA 
16451 TTTATTGGTG TTCCACGTTG CAAACTAAAT AATCTACGTT GGCTAATTTA 
16501 AGGAATTAAA C TAT AG TAG A AGGTTCTCAT TTATTGGGAT G ATT AG AAC C 
16551 AGCCTTTTTG CAGGCTATTA GCGAATCATA GCACTAGGGC TTCACTGCTA 

16 601 CCTCCACTGA CACCTCTGAC ACTTGAAACT TGAGGCCAGA TATCTGCCCA 
16651 TGCTGA'I'AGA AAACAACTGA ATAATTTAAT TTGCTAGATA ATAGAAAAGA 
16701 ATCAAATGAC TCTGCCACAT TGCTTGCCAG AAGATTGTTT TTCTCATTTG 
16751 TGACCTCTTG CCTATAAATG ATAGATAGTC CCTGTGCTGC ATGCTATAGG 
16801 TGTTCGTAAG AGAGTCTGGG AATGTGAGCT TTTTATATCC TATTTTTGGG 
16851 TGGTAAAGGT CATTCTATTA GTCTGTTCTT AAACTGCTAA TGAAGACATA 
16901 CCCCAAATTG GGTACTTTAT GAAAGAAAGA GGTTTAATTG AC TC AC AG TT 
16951 CAACATGACT GGGGAGGCCT AAGGAAAGTT ATAATCATGG GGGAAGGGGA 

17 001 AGCACACATG TCCTTCACAT GGTAGCAGGA AGGATAATGA GTAAAAGGGG 
17 051 GAAAAGCCCC TTATAAAACT ATCAAATCCC ATGAGAACTC ACTCTCACAA 
17101 GAACACAATT AGAGTAACTG CCCCCATGAC TCAATTACTT CCCACCAGGT 
17151 CCCTCCCACA ACACATGGGG CTTATGGGAA CTACAATTCA AGATGAGATT 

172 01 TGGGTGGGGA CACAGCCACA CCATTTCATT CCACCTCTGA CCCCTCCCAA 
17251 AT.CTCGTGTT CTCACAATTC AAATACAATC ATGCCCTTCC AACAGTCCCC 

173 01 CCAAAGTCTT AACACATTTC AGTATTAACA CAAAAGTCCA AGTCCAAAGT 
173 51 CTAATCTGAG ACAAGGCAAG TCCCTTCTGC CTATGAGCCT GTAAATTCGA 
17 401 AAGCAAGTTA GCTACTTCCT AGATACAATA GGGTCACAGT CATTGGGTAA 
17 451 ATACACACAT TCCAAACGGG AG G AATTG AC CAAAACCAAG GGGCTACAGG 
17501 CCTCATGGAG GTCCAAAATC CAATAGGGCC ATTGTTAAAC CTTAAAGTTT 
17 5 51 CAAAATTATC TCCTTTGACT TCATATCTCA CGTCTAGGTC ATGATTATGC 
17 601 AAGAGGTGGG CTCCCACAGC TTTGGGCAGC TCTGCCTCTG TGGCTTTGCA 
17 651 GGGTACAGCC CCACTCCAGG CTGCTTTTAC AAGCTAGTGT TGAGTGCCTG 

Fig. 2 (cont'd 9) 
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CAGCTTTTCC AGGCACATGG GTGCAAGCTG TAGGTGGATC TACCATTCTG 
TGGTCTGGAG GATGGTGGCC TTCATCTCAC AGATCCACTA GGCAGTACCC 
CAGTGGGGAC TCTGTGTGGG GGCTCTGATC CCACATTTCC CTTCCACACT 
GCCCTAGCAG AGGTTCACCA TGAGGGCTCC ACCCCTGCAG CAAACTTCTG 
CCTGAACATC CAAGCATTTC CTTACATCCT CTGGAATCTA GGCGGAGGTT 
TCCAGACCTC AATTGTTGAC TTCTCTGCAA ATGTAGGCTC AACACCCCAT 
GGAAGCTGGC AAAGCTTGGG GCTTTCACCT TCTGAAGCCA TGGCCTTAGC 
TGTACCTTGG CCCTTATTAG TTAAAGCTGG AGCAGCTGGG TTGCAGGGCA 
CCAAGTCCCT ATGGTGCATA C AG C AGGGGG GCCCTGGACC CAGCCCACAA 
AACCAATTTT CCCTCCTAGG CTTCTGGGCC TGCGATGAGT AGGGTTGCCA 
CAAAACTGTC TGACATGCCT TGGAGACATT TTCCCTATTG TCTTATTAAG 
ATTTGGCTCA TAGTTACTTA TGCAAATTTC TGCAGCAGGC TTGAATTTCT 
CCTCAGAAAA TG AG TTTTTC TTTTCTATGG CATCATCAGG TTGCAAATTT 
TTAAAACTTT TATGCTCTGC TTCCCTTTTA CAATTAAGTT CCAATTCCAA 
ACCATATCTT TCTGGATACA TAAAACTGAA TGCTTATAAC AGCACCCAAA 
TCATATCCTG AACACTTTGC TTC TC AG AAA TATCTTCTAC CAGATACCCT 
AAATTATCGC TCTCAAGTTC AAAGTACCAC AGATCTCTAG GGC AGGGGCA 
AAATGCCACC AGTCTCTTTG C T AAAGC AT A AC AAG AG TC A CCTTTGCTCC 
AGTTCCCAAC AAGTTCCTCA TCTCCATCTG AG AC C AC C TT AGCCTGGATT 
TCATTGTCCA TATCATTATC AGCATGTTGG TCAAAGCCAT TC AAC AAG TC 
TCTAGGAAGT TTCAAACTTT CCCACATCTT CCTATCTTTT TCTGAGGCCT 
CCAAACTGTT CCAACTTCTG CCTGTTACCC AGTTGCAAAG TTACTGCCAC 
ATTTCTGGGT ATCTTTACAG CAGTGCCCCA CTCCTGGTAC CAATTTACCA 
TATCCATTTA TTCTCATGCT GATAATAAAG AC AT AC C C AA GGCTGGGTAG 
TTTATAAAGA AAAAAGAGGT TTAATTGACT CACAGTTCAG CATGGTTGGC 
AAGGCCTCAG GAAACAGAAT CATGGTGGAA GGGAAGCAAA CACATCCTCC 
TTCACATGGT GGCAGGGAGA AGAATGAGCA AAACGGGGGA AAAACCCTTA 
TAAAATCATC AGATCTCATG AGAACTCACT CTCTTGAGAA CAGCATGAGG 
GTAACCATGT CCATGATTCC ATTACCTCCC AACGGGTTCC TCCCATGACA 
CG TG AAG AT T ATGGGAACTA CTACAATTCA AGAGGAGATT TGGGTGGGGA 
CACAGCCAAA CCATGTCAGT CATGATATGA GAAATTATCA AATTAAGATG 
TAGGGAAGGT TTTTAAAAGA TTTGAGCAAC CACAAATGAC AGATATGTGC 
TATAGTAGTG CAAAATACCA TTTTGCTCTT ATTAAAAATA TAATTGTTCT 
TGATAATCTG AATTATAAAT GTCATGGATA ATTATGATGC ATTATGCTCT 
CAGCAGCTAA AACTTCAAGC AAAATACACA CCTAGAGAGC AATCAGCCTT 
AACAATAATT C TAT AAA TT T AATTTTCTTT ATTTCTGATA ATTACATTTT 



Fig. 2 
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TAAATACATT ACCATTATTT TGGACTTATG 
ATATGATGTA GCTCTTAAAG TACATATAGA 
TCACCATTTC TTTGTAGTTT GTGCTTTCAT 
TACAGATTAT TTG C AG GAG C CTTTTAAATC 
CTTAGCTTTT GTCTGTATAT AATGTGTTTA 
TTG AG AG AG C ACAGGTATGG GTATCTTTAC 
ACCAATTTAC TGTATTAGTT TATTCTCATG 
TCACAATAAA CTGAGAACCA GCTGGTAAAT 
TTTCATTGTG GAGTTCTCAT TTTCCTTAGC 
TATTTCTGCA TAATCTAATG CATTCACTAA 
TCCACATGAT ATTAATACAG CCTATTTAAT 
AATAAATAAG TAGTCATGTG CCACAGAATG 
TTG AAGG AC C ACATCTATTA CTGTGGTCCA 
TTTTCCTGTA CATTTTCATT GTTCTGATAT 
ACCATCGTGT TAGAGTTGCC TGCAGTATTC 
ACCTAGGAGC AACAGGCTAT ACCACATACC 
TACCATCTAG GTTTGTATAA GTACACTCTA 
AAATC AC C T A ATGATGCATT TCTCAAAACA 
TATGTAACAA TACAGTTAGT ACAATATGTA 
TTTTAGCTAT TTCTCTTATA TTTCAAATGG 
GCACGGGCAT TTCATTTTAG TAACCACAGT 
GAAGGATGTT GAG C AAGG TT ATGACATGGC 
GTGACTCCAT GTTAGCAGAT AAAGTTGTAT 
AAGG CC AG AT AAGAGGATAC TGTATGTTAT 
ATGGCAGGAG AGATGCTATG ATTGAATGAA 
AAAGAATAAT GAGACTCATC CAATAAGACT 
TTCCTAAGCT AGGAGGAAGA ATGAGGAATG 
AGCACAAGTT TTTGATTTTT AGAACAAAGA 
TGATTCCTGG TTGAATTTTT AAGGATAAAA 
CTTTCCTGAT ACCATGCAGT ATT TG T AT C A 
CACATTCTTC TTGAACCTGG ACACTGCTCT 
TATCAGATAA GGTTATTCTT ACGTAGACCC 
TTTCCCAAAA TGTGAAATAA GTGACTTTTC 
ATGTCTTAAT AATAAATTGT GAATGTTGCA 
ATTG TG CATC CTGTGTTTGA TTCACTGCTC 

Fig. 2 (cont'd 11) 
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TGGGATGACA GCAGTTCAGT GAGCAGTGGT 
CATCAGCACT G ATG AC C TG A ACACCACATC 
ACATCACCGT CCCCTCTAGG AAG AAT ACT C 
CTTTTTCCAG TGTTTCTGCC AGCTTTTTCC 
ATTAAGGTAT AGCACAAGCC CTTAATCCAA 
AATG C AG AG A TAATAAGGAC TCCCTTTGCC 
CTTTCATCTT AGTCTTTCCT AAAGCCACAA 
GTGCACTGCA GCCAATTTTG AATAAACAGA 
ATATATATAT ATATATATAT ATATATGATA 
ATG T AAT AT A TGTGCCATAT AGCCTGGTGG 
TATTTATTTA TTGTTAATAT ATAGAGTATA 
AGATATTTAT ATATATTAAA TATCTATTTA 
ATTAAATATA TAAAAATATA TAACATATAA 
TATATAAATA TATATTTATA TATTTAATAT 
TTTGTATAGA TAACTACACC ACCAAGCTAT 
TAT AATG TAT AATTCTATAT TAATATAATA 
ATAATATATA TTC AATTG AT TACAATCTAA 
CATATCTCTC C TT AC AAT AT CGATATGTTT 
TTTTCATAGT CTAATTTTAG ATAGTTCTTG 
GAAATATATC ACTGGATCTG TTGTGAATGA 
GACATTACAT CATTAAGTTC TAGCTTGTCT 
GATATAGAAA ATTTCTACCT TTCTGTAGCG 
TGTATTTATT CTGAAATTCT CTAATATCTG 
AAAATTAATA AAATGTAAGT GAAGGTAAAT 
TATCCAGTTA TCTGAAAGTA CATTTCATTG 
ATTTTTCTAC ATAAAGTTTT T C TG T AAT AT 
AGATTCAGAG AAACGCTCCA CCACAGACGA 
AACTGAAAAA ACCAGAAGAA GATTTTGACA 
AAG TG G AAG A CTGTGTCCTC TGGACTTCCT 
GCAGAAAGCT TCCCTGTCTG TTTCACAGAC 
TGTCTGCCCA AGGAGGGGCG CCATCTAGGC 
CTCAAAACAC CCGGTAGGCT TGTCGTTTGC 
TTTACTTTAT TGTTTCCATT CAATCTTTGT 
CTAAAATACC AAATTCTTAT C C AT ATT AAA 
TAGTTTTTCT GTCTACGTTT CACAAACTCG 
C C T AT A TC T A GGCATTCTCT GGAAAGCCTC 

Fig. 2 (cont'd 12) 
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GATGCCTTAC CTTGCCAGAG TTATTTCAGG 
GTTGCTAAAG CAACAGTTAA GTTTTTTTGT 
GAAAGTTTGG AATTTTTACA CTAAATATGC 
TCTGAGAGGA ACAAGCAATG ACACTAATCA 
AAATATTGTC TG TG T ATT AG ACTTCATGAA 
TTCAAAATTG TGCCATACTT TTTTTAAAAA 
AATAATGCCT AATTATAAAT AG.TGCCACCT 
ACTCCAAATA TAATCATCTC AC AG AT ATTC 
AATTAGCTTG ATAGGGTTAA GTGTTACCTT 
ATAAGACATT AAC AAATAG C AAAACATATG 
TCTCTCATAA TGTTTCTTCT GACAGCCAAA 
GTCCTCTCAA TATATGAGAT TTTTGATCTA 
GAAATAATAT AAAGAAGCAT TCACATTTTA 
GGTGATATCA AAC TC TAC AG GCACATATAT 
AGGGAAAACC GATGATGCCA AAGCTTCTGA 
AAGGATCATC TCTACAAAGA TCTCCTTCAG 
GATGAAGGGA AAAAGCCCCC CTCAGGCATT 
CTCCTTTGGC TTTAAGAAAC CAAGTGGAGT 
CCAGCAGTGG AG C AACC AT A ACAAGTGGCT 
CCAAAATCTG CTGCCATTGG CGGGAAGTCA 
TTTGGACGGT TCACAGAATC AGGATGATGT 
AG AC TAC C C T ACAATATCGC AGCTTGCCCC 
AGTGGCATTC CTGGCCGAGG AGGCCACAGA 
TTCCAACGTC AGCAGCAAGT CTGCTGGGGC 
AAC C AAC T AA AATTGGGTCA GGGCGCTCGA 
ACAGACAAGG AAAAGGAAAA AGTAGCAGTC 
TTTGTCAGGT TCCCCCAAAT CCAGCCCCAC 
CACAAGGTCT CAGGCAGCCA GGATCCAAGT 
ACATTTCGAA GGTAAGGATG TATAAAATGA 
TAAATATGTG TTAGACACAT ACATTACATA 
TTTTAAATAT GTATAAGGTA TATAATATAT 
GTACACAGTG AGCTCTATGA AGCTTATCAT 
ATAGTTCTCA TTTTGAGAAA CAGTCAAACT 
TG AT AC TAG C AACACAAACA TCTAAGAGAC 
CTAAGTGTAG TGATAAGGCA AAGTAATAGC 

Fig. 2 (cont'd 13) 
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24801 TTGTAAAATT TCTATAGATT 

248 51 AACCAAATAG GTTTTCATGA 
24901 TTGCTGGCTC TTAATGAGTT 

249 51 CCTTGTAACA GATTCCATTG 
2 50 01 CATTAAAGAC AATCAAGATA 
25051 GTACAGTATC ACAACTGAAG 
25101 CTTTCTTTGA CATGACTCTG 
25151 GATTGTTTTT GGTGTTAAAT 
2 52 01 TTTTAAATAC CCATGTGTTT 
25251 AATTGGGGAC AAATATCTAA 
253 01 AATGG T AG AG TGTGGTTTAT 
253 51 TTGTTTTTAT TCTGTTGACT 
25401 AAGATTTTGA AACATTTATT 
2 5451 GTTTGAAAAA CCTAAAGGCT 
2 5501 GAACTGTCTT TTTAAAATAC 
2 5551 GTATAGGGAG AGTAAATATT 
25601 CTCCTGTGGC CCATTACCTA 
2 5651 ATTTCACTGT AGGAAATATG 
25701 ATGCCATTAC TTCATATCAT 
2 5751 TACCATCTGC TTCATTTACA 
2 5 801 AAATCTAACA GCACCAGCTC 
2 5 851 GAATAGCACT GGCAACTGAC 
259 01 TACAGACAAA CTGACTTGTT 
259 51 CTGCTCCCTG ACAGTGACTC 
2 6001 AACTTGCTTT AAGCGTTCCA 
26 051 TAAAAATAAA AAAAAAAGAG 
2 6101 ' GCGTAGAAGT TGTTTCATTA 
26151 ATGGTCAAAC TACTGACTTT 
2 6201 GAAAACCAGG GGTTTTCATG 
26251 CTCTTTCCCT AGTAGCAGCT 
26301 TTCTGCACTT GATAAAGAAG 
2 63 51 GACAGGAAGT GCTTTGTAGT 
264 01 TACTGTAACC CGAAATATTA 
2 6451 AAGTTGTTAT TTGTTTTTCT 
2 6501 AAATCTGCCT CTGCACCTAA 
2 65 51 GCCCAGCCCT AGTACCACAT 



TCCATTCCTC CTTTTCACAT TAAAAATTAA 
C TTTTGGC AT TCATTTCCAG TGTCATTTTC 
GGTGATCATA AATGTAGATG AAGTTGTTTT 
GACAGATTTA TACAGTGTCA T ATC TTG AC A 
TGACATAATT TGAAACTATT CCAGTGTTTG 
AGTGGGCTAA GCTTTCTAAC TCTTCATCTG 
G TAAGG ATC A TGACTTGGTT TCTGTTCCTG 
ATGTGAAGTT CTGCTCTAAG ATATCACTGT 
TTAAGTGGTA GGAAAATAAA TGCAGTTAAA 
ACCTCTCTGA GTCTGTTTTC TCATCTGCAA 
AGTTCATTAT GGGTTCAATA TTTTTAATGT 
AAACCCAGAA CTTTGATATC TTGGAAAGGA 
TTACAATAAA GCAATTTCAG ATACCTGATT 
TTATTCCTCC G TAG T AAT AT TAATGCTGCA 
TGATTCTCAT TGGGAAGAAT GAATTATGGC 
TCTGTTTCTT AAGTAAAAGC CAATAGTGCC 
TGAAACAATT TCTCATATTC GTCATAAAAT 
GATTTCATTG CAACTCAATT AGTAATCATT 
TGTATTTCCA TATTTACATA AATTTGATTC 
AAACTAAAAT GTTTTCTGAA CTAAACTCCA 
TGTTTCAAAT C AC T ATT AAA AAATGTATTT 
ATAAAACCCT TTGGCCTCTG CTGGGGAAAA 
G C C G AC AAT A TCAATATTGT TTCCAACCAA 
AG AC C AC C AG ATACTCAACA CAACTCCCTA 
TCTAGATTTT GAATAAACTG TTTAAAAATT 
AAGAGCTCAT TTAAGTGTTG TCTATCGAAT 
TAATGGTTCT GTAAATAGGT AACAGCAAGT 
GAGTGAAAGT CTCATGATCA CTTAAATTAT 
TTTGACTTAC TTTTGTTCCA CCCACTTCCC 
CAGTACTGAC CTACCCTTAT ATGAGAGATT 
TCCAAGCTTA TAAAAGTTCA TTAACATAGA 
TCAGTACATC AAAGCACACT TGGCTCTGTG 
AATGTGGATA TTAGCTTCTT GGAACAACTG 
TTTAGGTTGT TTGGTGCCAA GGCAGGTGGC 
TACTGAGGGT GTGAAATCTT CCTCAGTAAT 
TAGCGCGGCA AGGCAGTCTG GAGTCACCGT 

Fig. 2 (cont'd 14) 
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GGCAGTGCTG GTGGGCTAAG CGGCAGCAGC 
C TC AG AC TT A ACTACAGATG TTATAAGCTT 
GCCCAGCATC GGTTCACTCT TTCACATCAG 
AATATGAGCA GTTCCTCTGC AGGCAGCAAG 
CATGACTAGC CTCCACACGA GCTCTGAGTC 
ATCATGGCTC CTTGTCTGGA C TG ACC AC AG 
CTGCTCATGA GAACGGGTAG TGTGAGATCT 
TTTCCTGGAG GCATTGATAA CATCTTCCCC 
AACCCCCACC CCATTAAATT CCCTTGATTT 
AAAAAG ATGT AAGACTGATG AAACCGGGCC 
AAATTTACAG AGGAATAGAA TCATTAAAGG 
GTTAATATGA ATGCATACAT TTATACCCAG 
CAAGGAATGT ATTTAGATAT TGAATGAGGT 
ATTTGGCTTA AGCTTCAGAA CAACACTAGC 
TTTTGAAAGA CCAAATTGAA ATTATTCTAT 
GTTAAATTTG TTCCATTTGT ACTTCTATGA 
TGGGAAATCC TAGTTGGCCT GGTTAAAAAA 
GCCATGAATC ATTAGAGAAA ATTATAGTGT 
TGCTTAAAAA GTAATCATAT TACAAAGTGT 
ATATATAAAC TT AAAG TAG A GGACATAGCA 
GCTTACTGTG AAGCATCCCT TTTGAGCAAA 
C AAAG TG ATC CTCTCTTGAT TATACTGTAC 
ATG TCTT AAA ACCACTTCTT TTTCCTGATA 
TTGACATAAG TAAAGCTTTA AACATGGTCT 
TACTGGTCAC GTAAAATACC TGATATATCT 
TTTTTATTTT TTTTTTATTT TTATTTTTTA 
ATTTATTTTC TAG AC C TTC A GCCTTACTCC 
TCAATTAAAG CCCTTAGCTA GACACTCTCT 
TAATGTAGGA TGTCCCAATT TGAAATTCCC 
AAATACAAAA C C C AG AG C AA AACATTGCTT 
CTTGCCTAAC AATGAGACAG GGACAGCCGT 
AT AAAG T AAT TTTAACACTA ACTAAAATAT 
TGCTAATTAC AAAATACATT TTTCCTCCTA 
GCAAACAGCT CAGTGATAGC AAATAAAATG 
TGCTGTTATA TGCTATTATA G AC AG CAT AC 

Fig. 2 (cont'd 15) 
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283 51 GTAAAGACCA GTAAGGGTTC ATTTTTCCAC CTAAAATGTC GGGCTTCCTG 

2 8401 TAAAATCTTT GATTCTAGTT TCAGCACTTC TAAGGTAAAT GGGCATCTTC 

2 8451 ACATGTCATT TATAAAACTT CTAATGAATG AATTATATTA AAATAGATAA 

2 8501 ACAACCTATA GTTTTAATGA ATGTATCCTA GATTGTATGC TCATATGTAA 

2 8551 GGATTCTAAA TATCAACTTG ATAACCAAAC CAAACATAGT GCAAATAGGT 

2 8601 TATCATTTAT TAACCACAAC CACCTTCCAC AAAACTGGTC ATTTTTTAAT 

2 8651 TATTAAGATA ATCTGCAACA AGTTGGCCAT TTAGCCATCA GCCTATTTCT 

2 8701 TCAGCATTTA GACATTAATC C C AG ATTC AG AAATAAAGTC AAGTAACTAT 

2 8751 TTATAACCAA GTAACATTCA AATCAAAACT AGATGAAAGA TTGGTTAGTT 

2 88 01 GCATAGCTAT AACCAAAATG CAGTTTTAAT ATTTTACTCT AATCTATATT 

2 8851 TTAACTGAAG TCAATAAAAT TTTCACTATG GAAATACACT AGAAAATATG 

2 8901 CAATTTCTTA TTCTTTTTAA GCAGATTTAT TTATTGTACA TGTTCAGTCT 

2 8951 TTGAAATAGG CCAATTTTAT TTATGTTATG TTATGTTATT TATTTGTTTT 

29 001 GAAATGGAGC CTCACTCTGT CGCTCAGGCT GGAGGGCAGT GGTGCCATCT 

29 051 CAGCTCATTG CGTCCTCTGC TACCCGAGTT CAAGCAATTC TCATGCCTCA 

29101 GCCACCTGAG TAGCTGGGGT TATAGGAGCG GACCACCATG CTGGGCTAAT 

2 9151 TTTTGTATTT TTTGTAGAGA TGACGTTTCA CCATGTTGGC CAGGCTGGTC 

29201 TCGAACTCCT GACTTCAAGC GATCTACCCT CCTTGGCCTC CCAAAGTGTG 

2 9251 GGGATTACAG GTGTGAGCCG TGGCACCAGC CTGAAATAGG C C AATTTTT A 

2 9301 AAATGGGAGT ATTCCTACAT TAAAATGGCC AAATAAAGAC TTTTTCTAAA 

2 9351 ATAAACTTTA AACTAATTTT GGATAAATAT GTTTTGCCTT TG AGCCTTAA 

29401 TAAAATGCAT TAATGAATAT TAAGCTGTAA AAAGTACATG TT AAC T AC AT 

2 9451 AGCTATAGTG TATAATATTA ATATTAATTA GTGCCTTCCA GTAAATTACT 

2 95 01 AGATTAAAAT AAATTTTAAT AT AAG AC AC T GAGCTTTTTG TTTTC TTG AC 

2 9 551 AATAGAACTG CAAGCAATAG CAAATTGCTC TAATCCTTTC ACGTACATTT 

2 9 601 AAGAAAGTTT ATGACCTATT GAAGAGAAAA GTAGATCTAG TGGGTGATAC 
2 9 651 - TGGCTTCATT ATGGTTAATT AATTGATCAG TAGAATGTCA GAAATGCTAA 

2 9701 GAAAACCAAA G AAC T AC AC C AGAGAGAAAA TGTGTTAATG TAAATTTTAA 
297 51 GGCAAGTTAA T TAG C GAT AT ATAATAAAGA TGTATATAAG TTCATGATTT 
29 8 01 ACCTGTTTGT CTACAATTTT AGATGATTTT TTGATACTCA TATTTAAATC 
29851 GGTAGCTTTT CCTAT-AGATT TTAATTTTTG TTTAAATTCC TCTTCGTTAA 
29 9 01 ATTAAATAAA ATAATAAAAT ACACTTTTTA ACAGTTTTCT CTTCTGCAGC 
29 9 51 TGCTCTAGGT CATTGGTGGC CATTGAGCCA TAACTAGTCT ATATTTGTTT 

3 00 01 TGGGTTTTGT TTCATGTGTC TGACTCAACT AAATTTTTAA ATAATTTGTA 
3 0051 GTAACCAACT TTGCAAATTC TGGGTTTGTC TTTAAATGTC AGATCTGGCA 
3 0101 ACGCTGCCTT GACATTTCTG CCTAGAAACT ATTGGCTCTA GGCAGTCAGT 
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TAG TTGG C C T 


TTTTCAATGA 


31601 


TTATTTTTTA 


TAG TTC AG AG 


31651 


GTTTTGTTTA 


CATTATGTCC 


31701 


AGCTTGACAG 


AAATACACTA 


31751 


TCAGCACAAT 


TGCTACCTCT 


31801 


CTCTTCCTTC 


TTTGTTTGTT 


31851 


ATCACTTTTA 


ATTTTGATGG 
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TGACTGAAAT CCCCATTCGT TTTCATGCCC 
ATGAGTTTGC AACCTTTGGT GATTTG C AG A 
CATTAATATC TAGATTCAAA CATATTTCTA 
TGGTAACTTT AAATGTATTT ATTCTAATTT 
ATCATATAAA TATATAATTT TT AT AC AAC T 
TACCAACTTT TCATATTCTT TGTGGCATTA 
GAAATAAATA ATCCTTAAAA TGACTTATGA 
ATGTGGTTCA TATTTGGAGA TTTCTCATAT 
TGTTTGTTTA TCCACAGTAC TTAAGAAAAC 
CTGTAACTGG CCTCTACACA GTATAAGCAA 
CGATAAAGTT AAAGTTGTAT AAAGCCTTTG 
TAAATAATGG AGTACACATA GAAGAAAACA 
ATCAAATTTT AGGTCAGCCT TTTTACATTC 
TGCGTATCCT GTGTATTTAA TTTTGATCAT 
AGCTATAATT CTGCAATTTT AATTAATTTT 
CCAGAGATTA TAAAAGAAAT CCCTAAACTT 
TCCTCTTCCT GTATTTTTAG AGAGGCCATT 
GCTTTTCATT CCTTCTTGTT ACCAATTGTG 
ATAAAATTTA TAG C C TTTTA AATACCTAAT 
TATAGCTTAA TTTCATTAAA ATATTCAAAT 
AAGTTTTAAG CAAACATTCA TATATCTTTC 
ATAAACATGC CTTTTTATTA AAAATAATTT 
AAACAACGTT TTATACCATC TCTGTCAATT 
AGCCAGAGCA GATCTATCTC ATTTTGCATG 
AATCAGTTGC ATAGGGAAGG AAAAACTAAG 
TTCAGAGGAA AATTATGCTA ATGTGATTTA 
ATCACTAAAC TGTCATTGAA TTGTACTGCA 
GTGTGACGGC AATGGACATC TTGTCACCTT 
GTTAAGCATT ATATGTGTGT TACCAAAAAA 
AACCATTTTT GTTGGATGTG TAATTTGGAA 
TTAGGGGTTT TCTTTGTTTT AACAGCATGC 
CCCAAAAAGG GACTAAGGTA TATATTCCTC 
CTGTTGTTAT GTAAACTTTG TGTGCTGTCT 
TGCAATGTAG CACATGACAT TGAGGACGAA 
TTTCTCTGGC CCGAACAGTT GGTGAGATAG 

Fig. 2 (cont ' d 17) 
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319 01 CCCCTTAGGT AG AG AT AC T A GTAGAGATTG AGGCTGTCTC TCAAATTAAA 

319 51 TAAATTCCAA TGTGAATATC ACTATTTTGA AGAAATAATA CTAAACAAAC 

3 2 001 AAACAAACAA AACAAAAACA AACAAACAAA AAACTTGTCC CAGGCATTAC 

3 2051 TTTTTTGGGG GCAGCAACTT TGGTAGAATG CAGAACTCAC TTCAACAAAT 

3 2101 TAAAATAAAA TTAACTCTTC TAACTTTTGC CTATTAGAGT CATATGCATG 

3 2151 CAAATATTCA AAACCCATGC AGTCTACAGA TGTGGGCAGT TAATGTTGAT 

3 22 01 AGGTTGAAGG ATGCTACAAT CTGAATCAAA GAAAACATAT TTTCATCATC 

3 2251 ACAGGACAAA TGCTGTAATT AAGGTGTGAT TTTTATAGAA TCCTTTTGAT 

3 23 01 AAAATCTCAA AATTGTTTTA ATTTCTATTT TGCAGGGGTA CTGCTATCAG 

3 23 51 ATCAATTTAA ATCTGAATTA ATC TAATATC ATTTAATAAT CTCAAAATAA 

3 2 401 TTATTCCATC CATAATAAAA AATAAAATAA AAATTTAACT TATGGC CATC 

3 2451 TTTTACTGTG TACTTTTATC TGAGGAAGAG ATAGAATGAT CTACTAATAG 

3 2501 AGGTATAACA CTGTATGTGT ATGAAAAGTT GGCTAATTTT GGTGCTAAGA 

3 2 551 ATTTACTTAC AAAAAGAAAA AGAATATACT TAGTTTGGTG AAACACTGAA 

3 2 601 TAATGGCGAA ACTAGGTCTT TC TC C ATT AT TTTTTTTC TC TCCAATTTTT 

3 2 651 CAGCAATAGC AAATAGCTGG CAATTATTCC ATGTTAATAT TTTGATCCAG 

3 27 01 AAATTTATGT TCCAGTAAAG CGAGCACATC TCCCTCCTTA TTTTTGTAAT 

3 2751 CTAGGCATGA TGTCAAGTGG CAGTTTAACA AAAGAACTGT TTTTCCTTTA 

3 2 801 AAAAAAAAAA AAAAACAAAA GCTGCCAATA TGTATTCCAT TTCCCTATGC 

32851 CTTCTGTGAC CATCCTTCAT TTCCCTTGGC CCTGGCCCAC CACTGTCCTC 

3 2901 C AT TTG TAG T CCATGTTTTC ACCCTCTTTA CATCCTTTCT TGCCCTGTGC 

3 2951 TTTTGAGTTC TCAATTAACT TGGCTGTCTG CTCATTGCTT ATGATTTCCA 

3 3 001 ACTGCATATC TGATAGAAGC ATAATTTTCT CCTCAAAACC CTTTATCTTA 

3 30 51 TTTTTTTTCC CTATGTGATT CAAACAGATG GCGTAAGATC ATCTGGAAGA 

33101 ACTGAGCAAT TATAATTAGA TTCAATCTGT TTG AAATTG T TCATTCTGAA 

33151 TAG T AAC C TC CTCTGAATTG TTTTCCTGTC CTGGCATTGC CTTGCCCTTG 

33201 TAGATGTGCT TAAGTGTCAT AGCTGTGCTG TTTTGCAGAT ATACCCCATC 

3 32 51 ATCTCGGCAG GCCAACCAAG AAGAGGGCAA AG AGTGG TTG CGTTCTCATT 

3 33 01 CTACTGGAGG GCTTCAGGAC ACTGGCAACC AGTCACCTCT GGTTTCCCCT 

3 3351 TCTGCCATGT CATCTTCTGC AGCTGGAAAA TACCACTTTT CTAACTTGGG 

3 34 01 TAAAATATTC TAAAATATTG ATTTTGTTTT GTTTCTTTCA CCACCCACTC 

33 4 51 TCACAGAAAC CCTGGAATCT CTCCATAACA CAACACGTTT TCATTTAAAG 

33501 GGAGGGATAA AAGCACTTTA ACAGTACCTT TCATTTGTGT CATTGTTTAC 

33551 TC TTC AC AG A AAAATCTCCA AACATTATGC TATTTATTGC TCATGACAAA 

3 3 601 TGGTTAACAT AGATTAATAC TGTGGTTGTT TTCTAGTCTA GGCTCCAGAG 

3 3 651 GCTCAGAAAG TTCACTTGAC TTGAAAAAGT CTTACCATTA CTAAGGGTTC 

Fig. 2 (cont'd 18) 
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3 37 01 AAGGCAGTAA CCAGTTCAGA ACATCTGACT TTAATCCCAG GGGCCTTTCC 

337 51 ATTCCATTTA AGAATCCTCT TAAAAAACAG GAAGGCATCT CCTTATTTAT 

33 801 TTGTCTGAAA TATTAAAACA TCCTTAAAAC AAAATTAGTA ATCTTTTGTA 

33 851 GAAAATAGAA ACAATTAGGA AGAAAAAAAT ATGTAATTCC ATGACTCAAA 

339 01 GTTAACTTCT TTTAACACTG TTAAAGTTAA AACTCCTTAA AATTCATACA 

33 951 AGAATTTCTG TTAAGACAAT ACTCTGAACA TTTTCAAATA GATACAATGA 

3 4001 AAAATAAATT ACCAACTTAG TCATTGGGTT ACTTTGTATT TAACATCATT 

3 4051 TGTATGAAAT ATAAAATCAT TTGCATAAAA TTTCATTAAA AGCACTCTGA 

3 4101 GTAACAAAAT AATTAAAGAA AACTAAACAT GCCAGATACC ATTTAATAGA 

3 4151 TTCAATGACT TTAAAAATAT ATTTATTTTC TATAAAGTCA CATATAAAGT 

3 4201 ATTTTCATTA TTTTTATGGT AAATATTTTT ATTATTAGTT TATCAGAAAA 

34251 ACTTGTACAT AAAGATGAGT ATTGATACAT AATCTTATTA GAG CC AG AG A 

3 4301 CGATCATTCC TTCTAGAAAA ACACATCTCT GAATTTAGGA CGGAGGACAA 

3 4351 TGAAACAAGA AATTTCACTT TATAATTTAC CTTTGTCAAA CTATCCCAGA 

3 44 01 GCACATCAAT TCCATCATGA AAGTACTCTT TTGACATTAT ATAAAAAATT 

3 4451 AGTAATAGAA AACACACAAT CCAAAACCTT AT ATTTTC T A AACTTCAAGT 

3 4501 TAATCATCAA CTTCTCTTAG ATTTTTGAAG ACCTGAAAAT AAACATAATT 

3 4551 TCAAATAACA GAACTCAAAC ACCATATACA TTTGTAATGA GGCACAACAG 

3 4601 TCAATTTTGA GCCTTGTATT TTCCAGGTTT TAGCTGAATA ATCTTCACTG 

3 46 51 CTTTCTTAGC TTTTTGCCAG TCTAG TTTGG GGACTATTTT GCCTTACTGG 

3 47 01 GCCTAAACAG AGTGTAATAT TAAAATATGT TAATAAGCCA TACTGAGAAT 

3 4751 AAGATAAATG CAGGTTTCTA ACTCCTTAGG GACACAAGTG GGGACAACAC 

3 4 801 ATTCCATGAA CACAGGTGAA TGAATGCCCC TAGTTTCTCT GAGTTGGACA 

3 48 51 ATTTCATGCG ATCATTTTTT TCTCTGAGGC CAAAGTCTCT GGTTTGATCT 

3 49 01 TCCTAGCAGC TTCCAGAACA GAAAGTGAGT TTACTTTGTC TCCATATTCT 

3 4951 TTTTCTCCAT GCTCGGGAAT CCCCTGCTTT CCTGATCCCA C C AC AAAAAC 

3 50 01 TCCCCTGAGG ATGAAGCCTT GGCTTTCCAG GCTTCCAGGG AAGCCTCGAT 

3 5051 TCCTGGCTGG AGGTAGTTGT ACCACACTCC CAGAGGGCTA AATCCCATAA 

3 5101 ACATCATCTT CTGTCTTTGT AGATCATAGA ACTTTTTATT ATCATCCAGG 

3 5151 AAGATTTCTC TTTTGAAACA AGGCTGGAAA AACTTTATGT CAGTCCTGAC 

3 5201 CTGCTCTTTA ATGACTGCGT AGAGGGAGAT GCCCAGCTTA TCCAACTTGG 

3 52 51 GTTGCAGAGA GGACAGATCT GCAGCCCCTC TTGCCAGAGA AAACATCCTG 

3 5301 GCACAGCCAC AATCACAACT CCATTCTTCT CCCGATAGCT CCTTTGCTTT 

3 53 51 GAAACTCATT GGTTACTTCT CCAGTGTTTT CAGGTCTATA TTCTCCAGGT 

3 5401 ACTCCAGCAC CTCTTTCCAG GGCTTGGACA AAAATACATC TGTGTTGGCC 

Fig. 2 (cont'd 19) 
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37251 


CACAACTTTA 


ATTCTAATCT 


CTCAAATCAA 


CATTTCACTT 


TCTGTCTGAG 
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GCTCTAAAAC 


T AAAATCC C A 
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TTTTCAATAG 
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ACAGGAACCT 


GTGATTTTAT 


CTTCAGAATA 


TCTCCTACAT 


CTGTCTCTCA 
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TTTTCAGGGA 
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ATAATATTTA 
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ATTGTATTTA 
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GAACCTCTGT 


TCATTGGGCA 


AGAAAATGAG 


TACTCTTAAA 


ATGCAATAAT 


37901 


AAATTAAAGT 


TACTTTATTA 


TTAAATTTTA 


AATATATATA 


TATATACTTA 


37951 


CCTTAAXTAT 


GTCCTCTTGT 


TGTCTTTTAG 


CATCACCCAT 


TTTTGATTTG 


38001 


ACCATTATCT 


TTTCTGAATA 


ATCAGTAAGA 


TACAGGATTA 


TTATTAATGT 


38051 


TCAAAAGTTG 


CAGTATTCAT 


GTTTTCTTTA 


TTCTTTCTAC 


CAATTAAAAT 


38101 


GTGTTAATAT 


ATAAAATTTT 


TAGAAATTTT 


AC T AT AAAAA 


ATCACAACAT 


38151 


ATATTAGAAA 


ATTAAGATCA 


C T AC AATATG 


TCATATTTAG 


TAG ACT AC TG 


38201 


TG AG CT AC TG 


C C AC AG T AAA 


CTATGGTTCG 


TGTGTCGTTC 


CCAGCATGCT 


38251 


AGCCCTAGTA 


GAAACCATTC 


CCATTCAAGA 


AAGACTAACA 


AAG T A TAG C T 


38301 


TACATAAATC 


AAAAAGTCTT 


TGGATGAAAC 


TTCATTTGGG 


AAAATAACCC 


38351 


AATCGCTACC 


CTTCAATTTT 


TTATGAATGA 


AAAAATGGAA 


GAATAAAGGC 


38401 


CTCTAAGATC 


CATTCAAAGC 


CAGGAGACAC 


ACAAGAATTT 


CTAAATAGAA 


38451 


GAGAAACAGA 


AGAGGTCATA 


GTTCTTGTGA 


GCCATCTCAT 


AAC C TG G TG A 


38501 


GACTCATTGT 


CATGCCTCCA 


TGCATGATAA 


CAATCGCTCA 


GATTCATTTT 


38551 


TCATCTTGCC 


ACAAGGGTTA 


CATGCAGGAA 


CATTAATGTC 


AACCTGTCAC 


38601 


TTCTAATATC 


CATCTAATAT 


TCTCTAAATT 


CGATGGATCC 


TTTTGCATAT 


38651 


GGTGATTGTT 


AAACACCTTT 


GCATAGGAAC 


AGTTTCTATG 


CTTTTGTACT 


38701 


CAAATCTTCC 


TCTACCTTGA 


ATCCTTTCCC 


ATCTTCGTGT 


TCAACCTTCA 


38751 


ATCTTCTCAG 


AATGAACTCC 


TGTCTTCTAT 


TCTTTCGGAA 


GCATAGAATC 


38801 


TCACGGTCAG 


AAGAGACCAC 


ATCTGGTTCA 


ACCCTTCATC 


TCTTATGTAA 


38851 


AATTTTATGA 


CATCTCTAGC 


TTCTTCTTTA 


AACCCACCAA 


TGACAGAAAC 


38901 


TACTAAAATC 


TAGAAATAAC 


ACCTTTGAAA 


TTCTTTCTTT 


AAG AG ATC AA 


38951 


ATAAAATTTT 


' CCTGAATCTT 


CACCTATTGT 


TCCTAGTTAT 


AT AT ATC C AG 
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3 9001 ATTC TAC AAA ATAAGTCAAA GTTAGATTGC ATATGACAGC TCTTCATATT 

3 9051 TAAAACAATA TAATAAACTC ACTAGTTAAT GTCTAGCTGT AGATGCAAAA 

39101 GTAGAGAGTG ACTTGGGGTT ATTTAAAAAC CCAGTCCAGC CAGACACATT 

3 9151 GGATCATGCC TGTAATACCA GCAGCACTCA GGAGGCTGGG GCAAGAGGAT 

3 92 01 CCCTTGTCCA GG AG TT AC AG GCTACAGTGA GCTATGATCG TGGCACTGCA 

3 9251 TACTCCAGCC TGGAAGACAG AGTGAGACCC TGTCTCACAA TAATAGTATT 

3 93 01 TAATAATATC ATAAAAACCC AGTCCACATT TATATAGGAT CCTGTTTTCC 

3 9351 TCAAGTTACT ACAAATAAAT ATATAATCTT AATAAAAGGT TAGTGGCTTT 

3 9401 GCCAAGATAG TGGCTTGGCT ATGCAAATGC AATTTAAGAC AAAGTTGGTA 

3 9451 GCCCTCTTTT TCCTAATACA TTGCCATATC TGTTTCTCTT CTATTTGGAA 

3 9501 ATTCTTGTGT GTCTCTTGGC TTCGAATGGA TCTTATAGTC CTTTTATTCT 

3 9551 TCCATTTTTT AGTCATAAAA AAACTGAAGG G TAG TG ATTG GGTTATTTGC 

3 9 601 CCAAAGCAGA TGGAAAGCAA AACTACCACT AGAAGCTCTT TAC C AATTTG 

3 9651 TGTTCCATTC AAAAAATTAT CTTTGTATGT CTTACATTTG TCTTCTACTG 

3 9701 TATAGTTTTT CTTGTTCTAT TTTACATATT AACTTTTCTC CTTCTTCAGA 

3 9751 CATCTGCCCT ACTGGCTACT CTTGAAATCA GAG AC TGTGT CATATTTTTC 

3 9801 CTTCTATTCA ACTACAACAT C T AAAAGC AG ATCTGTCATA GTTATTAACT 

3 9 851 TAATTGAACA CTCTTAAATA GTTAGGTGTA ATTTCCAATG CAGAAGCTAT 

3 99 01 CAAAAGGGTT TGTAAATGCA AAC T ATTC C C TTTAAAATCT ATCCTAATCC 

3 9951 TCATTAATGT TTCATCTTGA TAG AG C TAAG TATTATGTAT TGAAATTGTA 

4 0001 GAAGTACACT TCACTTGGAT ATCTCTGCAA TCATTTAGGT AAGAATTATA 
4 0051 CAAAGCCAAA AAGCAAATAA AATATCCTCC TAACCCTATA GATACGTATA 
40101 CTAAAATGAT GCACTTGCAA ATTTGTTTAA TACTTCATTA ATTTAAACAA 
40151 GAGTAAATTC ATACTGTGAA CCAAGAATAG GG TG AC TT AC CCCAATCTTG 
4 0201 CCACCTTAAA CATAAACATT TTAAGTCTTC AATGTCCTAC AG TG T AC C T A 
40251 CTGGCTGTTG TCACTAATCA GACCGAAATG GTACTAATGG TCACTGCAGG 
40301 - CTGAAGGAAT ATGCTTGAAA GATAGGCAGA TCCTCTCCCT CTCCCTTTTT 
4 0351 TACTTTTTTC GCCTTTCCAT CCTTTCTTCT TTTTTTCCAA TAGATTGTGC 
40401 AC TTTGG AG A TTCATATTTT CTTCCTTTTC CATTACATTT TAAATATGTG 

404 51 ATTCTTAGTC CTATGCTTCC TTTTACTCCA ATCAATAACT GGCTCTATCA 
40 5 01 GAGGGTTGTT CTGTGTGTTA ATTC G G TT AA TACCAGGATT ATCAAGCACA 

405 51 GTGCCTTCCA AATG TG AG AT ACTTCTCTCC GGTTACCTCT GGGTTTACTT 
40601 TTCCTGTTTT ACATTGTTTT GAGAGCCAGT ACTTGTATTA AGAAGAAGTT 
40 6 51 TAGTGCCTGT GTCACAGAAA AAATC TT AG T AAATTTTGAA GTGATGTCAG 
407 01 AACAACTCTA AGCCACTGAC GGATTCCACA GGGTTTTGAA AATACTCGTT 
407 51 AGTTCCCTTT ATATCTTAAG AGGCTCCTGC CTGCTTTCTC AT AT AC C AG T 

Fig. 2 (cont'd 22) 
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ATATGAGCAT TTAGAATATC TTTCTCAATT 
CAAATTTCAC AACTATATTG TTTTCCAATG 
CAAATCTTTC CTTAAATTGA TG AC T AC C AG 
AGCAATAAGA AAATAAATTG TTATTAAAAA 
TTTGGAAATA TAACATGTTT GTGACTTTTG 
TCATCTTAAA C AG AG T AG AA AATCATTTCA 
GCTGCAGATA CCATGTATGT T AC AT TG TG T 
TATACACTAT TTCC TTATAA TCTAACTTTG 
CATGAACTTA CATCAATTAA GAGCTTGAAG 
GAAGCATTTA AAGCCCTTCT TGGGAAATTT 
TCTTTTTGAT TTTGCTTTTT CCATTAAAGT 
AAACCGAAAA C TC TAG AAAG ACCATCTTTT 
AACACCATGT TATTACATTT CTAGCAAGAG 
TTTTGTGTAC TGTTGCTTTA GAAATTGATG 
CCAGAGGAAA AGAGGGACGC AATGGGATCT 
CAGACTCTTA CCTTAAATAG GGACTCACTA 
TATTGGGATC ATGTTCTTAC TTTCTATCAA 
AG C TTAAG AG TAATAGGCTC TGTGTGTTTT 
AATTTGTCTC AATTTAACCT TCCCGGGCCC 
CATCCCAGCC CAAGACTCTT CCTTCGATCT 
GTGGGAGTGC CACTTCTCTG GAG G AAAG AC 
GGCTCATTCA GAGACAGCAT GGAAGAAGGT 
GATGAAGTCA CTTTATTTAA ACCCTGAGAG 
CATCACAAAG ATTCCTGAAG AGGAAAATAA 
GGAAACTAGA AGCTTGAAGA AGTTTTATTC 
TATGTATTTG GAAATATGCC AG AATTTG TT 
GAAGAGTTTA G AC TAAATC T ACTTTTCCAA 
CTATTTTCCC AGGTGCATCA AATATCAGAG 
TTGGTTAAAG AGCCATAAAG ACACACAAAC 
AAT AC C AC AT GTTGCTGACT TTTATTCCCA 
TTTTTTATTA TATCATTTTA GAATTCATAT 
AT AAG TC AC T ATTTCTTCAT CCAATGGCAA 
ATAAATATCT TTATAGTTAT ATAGTTCTAT 
AGCTTGAAAA CC TC AGG AAG CTGAGTTGAT 
GTAAACTGTA GAAGCTCAAA TGTCAAATTT 

Fig. 2 (cont'd 23) 
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42551 AACAATAATT TG AG AG AC TT TTCTCTTTGA TTTAATGAAT TTTTTTAGTA 

42 601 TC C AT AAAG A AAAC TT AC AG CATACATATT ATAAAGCATG TCAGCTAAGG 

42651 ATAAAATAAA AC TAG AC AT A CAAATTCAAA CTGATTAGAA TGAAATTATT 

427 01 AACCCTAATA ATTATG TTT A AAAGAAAAGT CTCCAAATCT TGAGACATAC 

427 51 CAGAGTTTAA GTCTTCAGCC ATCCATTTAC TTGTGGTATA AACTTAGGCA 

42 801 AGTTTCTTAA CCTTCTTATC CCTAAGTTCT GCATCTGTAA CTTCTTAGGT 

42 8 51 TTGTCACAAG GATGAAATAT GAGAACAAAG AATAATTCTG TTCCATGATC 

42901 TTTTCCCTTC CTACCTTCTT ATTTAAAGTA TCTTCTGACT GAGGGGTTAG 

42 951 GCAGCAATGA AAATTGACTC ATG TTTTTC A GGTCACCACT ATGGATTCAA 

43 001 TATACTGGCA TTAAATC AG T AGAGAATAGT TGTCATTGCC TTTTGCAATA 
43 051 TTAACCAAAC CACTCAGTTC ACTGTGACAG ACAGTGAATT ATATCCAATG 
43101 AC TC C AC TG A TTTTTTCCAT G TAG AT AG AC AAAATATAAC TACTCTCAAA 
43151 TGTAAGGACC CTGCTTTCTG AAATGGTTCT GTTGCTCTCT TCACAGATAG 
43 201 GCTTCTTATA ATACTTTTAA AATAATTTGC T AAG C AT AC A GATGGCTTTC 
43 2 51 TAGAGTGTGG CATTGACAAA T AAAG TG ATT TTT AT AT ACT GGGAAATTCT 
43 3 01 GGCCTTCAAT GTATCAGGAT TAAATAATCT GAATTTCTGA AAGCTAGCCT 
433 51 AAGTGGGCAA GATGGCTTTT TTG TGCTCAC GCATTGAATA CTGAACTATT 
43 4 01 CTAGTTCTTA AATGGCGATC TAGATTCAAG ACTTATTGAA CTAGATTGAA 
43451 GGG AC TTT AT TGATATCCTA CCTAATGCTC ACACTGACAG ATGAAGAGAC 
43 5 01 TGAGCCACAT GTTCTAAGGT CATAAACAGA AAGAATGAGA ATGAGATGGT 
43 551 CTAATTAATT GTCCACCTTT CCTATGGTAC ATCAGGGTAA CACTTTAGTT 
43 601 TACGAGGGTA TTATTAGAGA TAGAAAGAAT TTTTTTTTAA AT AATTG AC T 
43 651 CAAATACCAA C ATTTTG C AC ATTACATAGA GTAATAGCTT TGCCCAAGTT 
437 01 AG AAAAC TG G GGGTTCTTCT TTATTCCTCT TTTGACCACA TC TAT AT AC T 
437 51 C AG TTT T AAA AAGGTTCTTC CTGGTATCCT TCAATTCCAT CCCCATGTTT 
43 8 01 TCATCTACAA GCCTAGTGCA GCTATTCCAG CCGTCTCCTG ATCAGGTCTT 
43 851 -AAGCACCTCC C AT ATG TC C T TG TAG T ACC C ACCATATTGA TCTCAGTAGC 
439 01 AATCACAGTA CTCTATTGTA AATATCTTTT AAAT T ATT AT CTTCTCTTTG 
43 951 AGCTTTTGGG ATTTTATCTT ATTTATTTTT G T AG TT C C AG GATCTAGCAA 
44001 CAGCTTGTCA CATCGTTCAT ACTCAACTAA TG T TTG TTT A ATGCACAATG 
44051 AG C AG AAAT A AACATACTAC TCCATAGTAA AAAGAGGATG AACTTTTCTG 
4 4101 CAAATATTAA TC AG C ACC AT TTTATCCACC TTTTGGGTTT AGTACATTGG 
44151 AAGTATAGGA G TAT AAAG C A GAATGTCCAA TGTTTACAGT GATATTTTGA 
44201 AATAGATAAA AGCCAGTGCG ACATTTCCAT TCTCAATTTC TC TG AG AC AT 
44251 C AC C TTG AAA AAAAAAAGTA TTTTTC TC TT CCTAAAATTA GTAAAGGAAC 
44301 AGTAATTCCA CATTTATAAG AGTATGATCA ACGCATCACA GATAATGTTG 

Fig. 2 (cont'd 24) 
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GTGCTTATTT TCCTGAAATT ATATGGAGAA 
TTGTTGGATA CAAATGAAAT AAATAAGG T A 
AAAGTTTATG GCAACTTTAG TTTGGGTTTC 
ATGGGAATTT ACTGTAGCTT TCAACATGTA 
CATGCTTGTA GGCTTCTGTC TAATAACTTG 
AGTATGGATG GGGCTCTTCC ATGTCTCAAC 
ATAGCAGGCT GGTTCTCAGA ATCTTATAGT 
CCCTAACCAC CCGTTCTTTA CTTTTTCTGT 
AGACCTTATT GTTTTAACTC TATTGTTCTG 
GCTCTTCATT ATCACTGGTG TCCAGCACTT 
AGTAATGGCT GTTAAGAAAA AGCTTGTGCT 
TGAAATAGAT CATTTTACTG TGAACAGATC 
CAGGAGTTGT GTAGCAAAAT AACGGCATAC 
ATAAAGTGCC AGGTGCTCCA CCTGCCATTC 
TTCACTTGGC TTTTGTGCAC TCATCATAAT 
C G ATTTC AT A AAACCTACTG AGGTATGACT 
ATGTAGTCTG CTATACTATC ATTTTAGTAA 
GTTCTTTGAG CCGTGACTGT TAATTGTTCT 
TTTATTTTTA AGTTTATTGT TGAGAGGATT 
AGGGTAAAGG G TAAAAG C AT AAAAGAACCA 
TACCTTTTGA AAGAGTGTGA TTTTTTTAAC 
ATTTATTTAT TTATTTATTT TTG AG TCG AG 
CTGGAGTACA ATGACACAAT CATAGCTCAC 
CTCAAGTTAT CCTTCTGCCT CAGCCTGTCA 
CGCACCACCA TGCCCAGCTA ATTTTTAAAT 
TGCTATATTG AC C AG AC TG A TCAATACCCA 
TGCTTTAGCC TCCCCAAGTG C TG GG ATT AC 
AGATAGAAAC TTTATTTATT AAGAGAAAAA 
TTGCAAACGT GTGACATTAT AATTCATTTT 
TTTGGTAAAT ACAATTCTAT CTTTTTTAAA 
ATATTTTATG AGTGAGTCTT TTTAGAGCTA 
TAAAAGCAGC CTACTGAATC TCAATGGGAC 
TTAATCCTGT ACTGTAAGGT TCATGAAGAG 
TGATCAACTG AGCATCATGG GCAG TATTTT 
TTCAAATGTT TGTTTGAACC TTCTCTTTAT 

Fig. 2 (cont'd 25) 
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46101 AGATTAATCT CATATATTTA CTGCCTTACA TAGTCATTCA AAATCTGACT 

4 6151 GTTATTGGCA GAAGTAATAT TTTTCTAATC TCTCCTTTCA ATGATTAAAA 

462 01 TTACCCATAG CTTCTAGAAA TTAAGAAATC ACGATTAGTT TTTAGGTAAA 

462 51 TGTACTTTTT GTGCAAATGG ATAAAGTGAG GAATGTGTAA ACACACATGA 

463 01 AAAAAACACA TAAAAGAAAT ATATTAAGAC TTAGTGTTCC TCCTGTTGGG 
463 51 CCAGCACTGC CATTTGTTGG GGAATTGTAT TCTGATTTAA ACCATTGCCA 
46401 TTTACATCTA TGTGTAACAT CAAAAGATGT AGCATCATTA TTATTC T AAA 
46451 TACATACAAT AATTAATATT TGGATAAAGC TACCTTCATG AAACCTAAGA 
46501 AAAACTAAAT TAAAAAGAAA GAAAGAAAGA AAAATACACT TAGATAGAAG 

465 51 AAATAAGGTC TAGTGATTGG TAGCACAATA GAGTGACTAT AGTTAACAAT 
46601 AATTTATTGT ACATTTCAAA ATAGCTAGAA AAGAAGATTT GGAATGTTCC 

466 51 TAACAGGAAG AAATGATATT CTTCCTAAAT GAAGAATGGG ATATTCCACT 

467 01 TTCCCAGATT TGATCGTTAC ACAGCATATG TTTGTATAAT ACCACATGCA 
46751 C C CC AT AAAT AC AT AC AAC T ATTGTGTATC CCAATATTAA AGATTTTTTT 
46 801 GAAAAATTTA TTCCTCAAGA AAAGGATCAT GAGTTTAAGA AAAAACAGAT 

468 51 TACTAGTCTA CCAGTGTCCA GTAGACCTTT CTGTGTTAAT AAAAGTGTTC 

469 01 TGTATCTACA CTATCTAATA TAGTAACTAT GAACCATATG TTGCCATTGA 

46 9 51 TTATTTGAAG TATATCTGGC AAAGAGATGA ATTGACTTTT TTATTTTAAT 

47 001 TAATTTACAT TGAAATAGCC ACATGTGCCT AGCAGCTACT AGATTGGATA 
47 051 GTGCAAGTTT ATAGAGAACA CAAGGGGTAC ATTTG TAG AT AGGAGTGGGA 
47101 TGTCAAAATG ATGAGGATAA T T AG AAAGC A TACATGAGAA ATATTGTTTT 
47151 AAGAGTAGAA TATGAAATGG G AAC AC AG AT TAAAATAGAG TATGTATATA 
472 01 TATACATATA TATGTGTATA TATATACATA. TGTATGTGTA TATATATACA 

472 51 TATATATGTG TGTGTGTATA TATATATATT TATAGGCCAA TATATGGAGG 

473 01 TAGGGTATAT CCTAGTGTTA AG TG AG T AAA GAATGGATTA GGTGATCGAG 
47 3 51 CCACATGAGA AGGTGATATT ATTAGAAAAT TGAAAGTTGT ATTTG AG ATG 
47 4 01 - ATGAAAATGA TATATTTGAA TTGAAAAGTA AACTGTAGTA AAATAATTCA 
47 4 51 AATAAATGAA TATTTGGGGA ACTACTTAAG AG AAAAAT C A TAAAACATGA 
47 5 01 GGAGTCATTC TTTCCCCAGT CCGCCATGAT CAGGCCTTAG GATTTAATTG 
47 551 GCAATGAGAA AATACCTATG AAAATGCTTT TTAAACTATC ACATGAAAAA 
47 6 01 GCAATTTATT ATTTTTCATG CCTTCTTAAT AACTCTCAAT AGAGATTTAG 
47 651 TTGATTTGCA TTTTTGCCTG GTTCAATCAA GAAATTATCG CGTGACATCA 
477 01 GGCAAGTTGC CAAATTTCTT TGGACTATAC C TAT AAAAT A AAAT TTG AAA 
477 51 ATATTAGCTA GATCTAACCC ATTTGTCTCC GGATGTCTGC AAAGTGGTTG 
47 8 01 GAAATCACAA GCCTAACCTG ATCTGCAGAG GTGTTACCTT TGGCAAACTT 
47 8 51 ATGGTTTTTG TGTTTGTTTT GAAATCTAAG GCCAAGCGCG GTGGCTCATG 
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CCGGTAATCT CAACACTTTG GGAGGCTGAG GCGGGTGGAT CACTTGAGGT 

CAGGAGTTCG AG AC C AG C C T GGCCAACATG GCAAAACCCC GTCTCTACTA 

AAAATACAGA AATTAGCCCG GTGTAGTGGC ATACGTCTGT AATCCCAGCT 

ATTTGGGAGG CTGAGGCAGG AGAATCGCCT GAACCTGGGA GGC TGAGGCT 

GCTGCAGTGA GCGCCACTGC ACTCCAGCCT GGGCGACAAA GCCAAACACT 

GTCTCAGAAA AAAAAAAAAA AAAAGGAAAA GAGGGAGAGG GGAGGGAGAG 

GGAGAGGGAA TCTAAGCCAA CACTGTGAAA TATTGTGAAA TATGGAGCTT 

CTACCTAAAA ATTCAAAATT TTAAATTCCT TTTAAAAATA ATTGGAATAT 

CTATGGAATA TCTAGCAATA CTAAGATGAA ATTCCTCTGG GTTTTCAGTC 

ACCTGTAATT GACACCTTTA GATGTTGGCA TGGGCTCTCA GGAAGCCACA 

GCCTCCACCA ATGCTTTTCT TCCTGACACT GAAGCTAAAT TTGGGTGGCT 

AGTTTTCATT GTGCTGTTGC TTTCCTCATG GGAAAGAAAT ACCCTTTGCT 

ATTTATATTG CTGTCAAATG GGAAAATGAA AG AC AG CC AA GGAAGATCAT 

GTGACTATTT AAATACTTCA AGTCCATTTA TTCTTTATTA GCCTTGTCCT 

GTTAGGCATT TAAATTTTTG ATCCCTGCAA TAGATGTTTT TTGATTAACT 

G TAT ATT AAA AACTATATTT AACCTGTTTT GAATTTGAAT TCTAAATTGT 

ATTTTTTCAT GAGAGCAAGT GTCATTTTTG ATTCATTGTG GATTGTTTAA 

CATGTTGCCT AACAAATAGC TAATACTAAC GTCATAACTT TTTAATTAGT 

AAATTTGAAT GGATAAATGG CCACTTATTG G C TT AT AG AA TAAATAAAAA 

CATTTTTATT CAGTCAAGTG TTTCATATTT TTTATCATCT CCAGGACATT 

GGGCTTGCTC AAAACCATTG TTAAAAAAAA AATGGCAAAT AATCCAGTTC 

CATCATGATA TCATTAATCC CACACCTAAG CTACTGAAAA AAATATATTA 

ATATTCTGGC TCATTGCTTT ATTTTTATGG TAACACCCAC CTGGTATTAA 

TAACCACAGA GTACGAAAGA AGGCAAAGGT TAAAGCAAAT AATAGTTTTG 

AAAAATTGGT AGTGAAAAAA GTCATGCTAT ACGGTATGTA TATAATAGAT 

ATTTAATGAT TATGCTTGCT ACT AG TAT AT GTAACAGGAC TATTATAGAT 

TAACAAAAAT GCGGTGAGTA TATTTC TTG A TTATTTTTTA AAAGAATAAA 

TTATTATTTA AAAATACATG AATTATTTAT TGATTCTTGA ATCTTTACCA 

GCTTTCTATA ATTCTAGGAA GCCTAGAAGC AGAATTGGGC AGGATAAACT 

GGCAAAAAAT G TAAAAAG T A GGCCGGGCAC GGTGGGC TAC AGTGAGTCGT 

GAATGCGCAG TGC AC CTG AG TGATAGATCA AGATCCTGTC TCAAAAAAAA 

AAAAAAAAAA AAAAGAAAGA AAGAAAGAAA AACAACAACA AAAACAAAAG 



CAAAGTACTA GGGAAAACTA ATAGACATAG TTACATAGTT AATTGTGCCA 
TATGTTTTAA GGCAATGAAA CTTTTATCTT AATATTCCTT GCTTACTTTT 
TATTCAAAAA CCAAACTGTG TATAAAACCT TAAAATTATT AGG ATC T AAA 
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TTAAAAATAA 


AATGAAAGGA 


AGTTTGTATA 
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GCAAGCTGTC 


C TTTC AC ATT 


CTAGATTGAG 


TCTTAGCTCA 


ACACCTAATA 


51151 


AGTTTTCT AT 


AATAGTAAGC 


ACTC ATTAAG 


TCATTGATAA 


ATGAAGGTCT 


51201 


ATGGTCTTCC 


TATTTTATTA 


CAGTCTTTTT 


CCCACTCCCT 


GTAAGACCAT 


51251 


CTACACAGGA 


TAATGGTTGA 


AACTTGGGCA 


CCAAGCCTCC 


ACAACACAGG 


51301 


ATACTAGCAT 


CTCAGACTAT 


CTGTTTTGTG 


TCATTATCTT 


GTTGCCTCTA 


51351 


ACTGCCATTT 


TATGTGTGGT 


GTGTCACCTA 


TTGTTCTAAT 


CACATATTTC 


51401 


ACAAATACAT 


ATTTGGTTGC 


ACTCGTGAGC 


AAATCAAACT 


GCATTCAGGA 
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51451 


AAGAATACTA 


TTTTAATTTC 


51501 


GAGCAGGAGG 


ACTTTAATTA 


51551 


TTGATTGGTT 


TACACTGAGG 


51601 


AACAACAGCT 


TTTGCAAAAG 


51651 


AGCCTTTAAA 


AATAAAATTT 


51701 


TGATTAAACC 


AATTTGTCTT 


51751 


TTTTTAAAAA 


TCAAAGTGGT 


51801 


TTCTAACACT 


GATCATAGTA 


51851 


ATTTGCCTTA 


G TAG TAG TT A 


51901 


TCATGAGTTA 


TGGTTTAAAA 


51951 


GATTATATTT 


TAAAGATACT 


52001 


AGAAATGCTC 


TTTGAGTTTG 


52051 


TTTTCTGAAA 


TCATTCATAG 


52101 


ATTTT AG AC T 


GGGGTGTATC 


52151 


AGTAGAA^CA 


CATTTGGTGA 


52201 


AGAGTTCTCT 


ACTTTGTTAG 


52251 


ACAAAATCTA 


ATGCCCGGGG 


52301 


AGAATTCTCT 


TTCAAACTGA 


52351 


TAGATAAGAT 


TTG TTTTTGA 


52401 


TAACATCCAA 


GTAACTGAAA 


52451 


CCTAGTGTCA 


GATTGTTATA 


52501 


GACTTGCAGA 


ACAAACTGCA 


52551 


ATATAATATA 


TCCTTACCCT 


52601 


TTAACTAAAA 


AGAAAGATAC 


52651 


ATTAGTTGCA 


CCTATCATTG 


52701 


AAGCTATAAA 


GCATCTCTGA 


52751 


GATAATACAG 


ATTTGGGAAG 


52801 


ATAACAATTT 


AAATGTGAAC 


52851 


AGTGATTGCA 


GGATATTCCA 


52901 


AATTAAAATG 


TACAGAGGAA 


52951 


ATGG AG TAT C 


TTT TACT AAA 


53001 


AG TTTTT AT A 


AAGAAAATAT 


53051 


TAGAAGGAGA 


AGAATCAATA 


53101 


CATTTGAGGG 


ATGGCAAATG 


53151 


TAGAGTTTGA 


CACATAACAC 



CCTTGGTAAA ACATTTGTCC TGGTCAAAGA 
TG AC TTT ATT CAAGGTGAGG TAATGGCTGT 
CAATCAGACA ACAGAGAAAA AAAATGCCTT 
TATTCCTTTC CTTTGAAGTC TT ATTTT ATT 
GTGCTATGTT TAAAAATATT TGAAAATTAT 
TATAATCTCT GAACCAAAGA GTGGATATGA 
TTTATTTACA TCACATGGAC ATGACAAAGC 
TAGCTACTGA AGCATCGAAA TGCTACATCT 
TTCAACTCCC CTTTTATCAT TGATGCTGTA 
AAACAATTTC AATC AC TTT A CAGTTTCCTG 
GGAATCATGT AATAG AG A CT ATTTAATTTG 
GATTCATTTA TGAATAAAAT AGACGCTGTA 
TCATTATCTT AT AAATG T AA AGCAAATGTT 
TGTTCCGGAA AAAAAAAAAA ACAGGAACGA 
AATTATATAA GTGTCTACTG TTTCCAGCTT 
AGTGTTTGAG TTGACCACCA TTTATTTTCA 
CAAAAACTAG ACAGTTAATA AACTATGTCA 
GACAGCATTC CAAAAGTTCA ACTACAACTA 
AGAAATGAGA AGCATCAAAA GTAGAATGTT 
TCC CTTGAG A CTAGATATAT ACTTATAGAA 
AATGTTCTAT CCTTATTAGT CACAACATGA 
GAAAGTGCTT GAATTAAAAC TTTAAACATG 
TTTCTGTTTC AGTTTTATTG GAGTGTGAAC 
CTTAGAATAT ACATTATATT GGTTTATCTA 
GTTTTTTCCC CTGATTTTTA AG ATG TG GAT 
GCTAATAATA ACTCACTAAA TAAAGGTCTT 
GCTTCTCTGC AGTCATTGAA ACTCCAGCCA 
TGATTAAATG TTGAATTAAG CCCAAGTTTT 
TAGCCTTTGA GAAGTTTTCA AACTATGAGA 
AAAAAAACCT AAGATTTTCT GAAAAAGAAC 
AAAGAACAAG AAAAATATGT GTGTATATAC 
TTTTCTACAG TTTTATTACC ACAGTTTTTC 
CAGAGGGTAA ACTGCTCTTG AGTCATTTGC 
GAGCAAGTGA GCGTACTTTG ATTTG TAG AT 
TTTGCTTTTG AATGACATTT GCTTGTTACT 

Fig . 2 (cont ' d 29) 
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53 2 01 GTGGAGTCAG TGTTCATATC CTTTATTTTC AGGAGTTGCT GCTGATACAA 

532 51 TGGGGTTAGA ATGAGCTAAA TACAGCATTT GCTTTCTTGG TTTGAATTCT 

533 01 GGGTTTTAAG TAAAAATCTA CTTGCCTATT CCATTGATTT TTTTAATTGC 
533 51 ATTCAGCAAA TCCATAAACT GCGGAGAGAG CTGGTTGCAT CACAAGAAAA 
53401 AGTTGCTACC CTCACATCTC AGCTTTCAGC AAATGTAAGT CACTTCATTT 
53 451 TTAAAATATA TTACAACAAA TTTTTATAGA GGAAAATGAA ATCATTTTAG 
53 501 TAACAAACTT ACAAATTTTC AGTGCCTGAT ACAGACTTAG ATTACCAACT 
53 551 AGCAGGACTC ATAAAAAGTT AACATTTTTT GCCTACTCAG TAATAAAATG 
53 601 TAAATCCAAA CTGATGAGAG GCAGCAATAT GGTTAAAATG GCTTGTTGTT 
53 651 TCTAATAAGA TTGGAAACAA TAG T AAC AG C CATATGGGTT ACTTCTTTTC 
53 701 TTGTTTGCTA TTTTTATTAC TCCTCTTGCA TAAGATTCCC TGACAATGTA 
53 751 AGAGGGGTTG TTAGTGTTTG ACTTTGGAAG- ATAAAATATT CCTGTGCCCA 
53 8 01 GCCTCCTTCA TCTCAATGTA TTGAACAATT TGTTAAGCAT CCAGTTAATT 
53 851 CTAAAATATG AAATTAG G TC TAAATAGGGA TAGCTTAGCT GCACTGTGGA 
53 901 TGAGATATGG TTTGCTCAAA AAACCTTGGC AGCCTTCTCA TAGCAATTTA 

53 9 51 AAAGGGTACA CTTTTACTGG C AC C AG AG C A GCCCAGGATG GCAGAAATGA 
54001 TGACAATGAA GACCGTCAAT TAAATTAACA TTTACTGAAT ATCTTCCACT 
54051 GTGTCAGGGA GC AC TC AG AG TAG ATG C AG A ATGATAAAGG AGAAATGTGG 
54101 CACTGTTCCC AGTCCTGAGG AGCAATGGTG TTAAGAACAG CAGTGAGGGG 
54151 TAAGGAAATG CCTGCTATTT TGC CAT ATG T CTTACCTCTC TC AC TC AAC A 
54201 GTCCTTTGCT CAGTTCTGCT GCATAGCTTT GGGCCTGCTC TGTGCCTCCC 
54251 CACCCCTCCC ACTGCTCCTC TACTGAGTTT TTCTATCTCC TAGACAAAGC 

543 01 ATGATATGTC AAGAGTGAGC AGGTGCAGAC CCACAGTGTA AGACTTGAAT 
54351 AAGAGCCATT TTTAAATTTT TTTTAAGCTA TCATTGTGCA ATATAAATTC 

544 01 TAAGTATGTG TATCATTTCA TTCACAATGT ATTCATTTTA GCACTGTATT 

544 51 TGAATTGATT TTATTTTCTG AAATTTGGGA GAATTAATTT TGGATTTATT 
5 45 01 - CTATTTATTT TTAATAGATG GTGTTAGGAG ATTCCTGAAA ATAATAGCAG 

545 51 T TTTT AG AT A ATTGTTTAAG CAATATGAGA AAATAAGGGT ATTATTTAAC 
54601 CTTGTTGTGT TTTTAAAGAG AT AG TC C AG A GGCAACCGTA AATTTTATAA 

54 651 TATAGGCTAC ATG T AT AG AA GTATGAAATA TTGTTGTCTA GGTTCCTGAA 

547 01 TTTGTACCCA GAGGAAGTAG AATAATGTAA ATGTCAGAAC CTCCTGGGTT 
54751 GTGTTTATCT GCAATAAGAA AGGCTCAATG GCAAACCTTA TTTATTAGAT 

548 01 TGTCAGGATA CTTGCAGATG TCTTGAATGA TTACTCAGGG TTTCATTTTA 
54 851 TTTTTAATGT CCCTTGGTTG AGCTCATCAT ATAATTCAGA TATTGGAATA 

549 01 ATAAATGGCT G C TAG AC AT A GTGGAAGATG GGCTGATACT TTCCATTTGA 
549 51 AATGTAATGA TGCTTATTGT CTTCAAAAGA AAAAACTAAA ATGGTATTTC 

Fig. 2 (cont'd 30) 
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55001 


ACATTTTTTT 


GTTTTTGTTT 




TTCTCTGAGA 


ATCTCATTCT 


55051 


TACTCATGAT 


TATTGGTTTC 


TTGTGTACCA 


TTTCAACATT 


TTTCTATTAT 


55101 


ATGCTAATGT 


G T ATATAT AC 


TTAATACACA 


CGTGCAAAAG 


CTTCCACACA 


55151 


C AC AC A C AC A 


CACACACACA 


CACACACACA 


CACACATACA 


CACACATACG 


55201 


GAACCAAATT 


CTAACATAGG 


GGAATAATCT 


TC GG AG TG AA 


CTCTGTGCTG 


55251 


CTGTTTGAAA 


ATGGAGATAT 


AATTT TAG AA 


AGGTTCCTGC 


AGTTGGCTAC 


553 01 


CCACCTCGTC 


TGCTCTAATT 


ATGCTTGTCA 


C AC T ATTTTC 


ACTGATGTGT 


55351 


TTTCATGACT 


TTAGGGCATG 


AATTCTCAGC 


TG GG TG TT AA 


TATGACCAAC 


55401 


AAAGGGTGAA 


AACAGGTTCT 


TGCATTTTTT 


TAAGTACTCT 


TTTTATGTGA 


55451 


AAAGCACAGA 


TATGCAGATA 


ATACATAACT 


GAACATCCAG 


CATATCTGTG 


55501 


GCTTTAAAAT 


ATCACGAAGA 


AGAGCACAAT 


TAGGGAAAAG 


AAAACATCTA 


55551 


TAGTGTTTCC 


CTAGGGGAAC 


AATCATTTAA 


AAAAAAATAA 


AAATAAG G AA 


55601 


CACAGACTAG 


AAGCAGCAGT 


GCCAAATAGA 


TAATTCATGC 


TAGTCTTTGT 


55651 


GTTAATTTAA 


AAAGTGCTAG 


TCTTGGAGAC 


AAACGCCCAA 


ATTGCTCTAG 


55701 


GTTCCACTCA 


GCTGTATGTG 


TTATCATTAG 


TATTAACTTT 


TGCACGCTGA 


55751 


TGGGAGACTG 


ATATATATCC 


TGTTTTATGT 


TCCTTTAAAC 


AATTTATAAT 


55801 


G TAATTTAG A 


AACCTTCTCA 


AATCACATTA 


GATCCACACA 


AAAACCTGTA 


55851 


CATAGCAGCT 


TTATTTTTTA 


AT AG C C AAAG 


AAAGGAAACA 


AC C AAAAAT A 


55901 


TCCCTTAATA 


GGCCAGTTAA 


TAAACAAATT 


CTGATACATC 


TATATCATGG 


55951 


AC TACT AC TC 


AGCAATATAA 


AGAAATGACT 


ATTGATACGT 


GCATCAACTT 


56001 


GGGTGGATCC 


CAGGGGTATT 


ATGCTGAGTG 


AAAAAAGACA 


GTTATAGAAG 


56051 


GTCAAATTTT 


GTATAATTCC 


ATTTATATAA 


CATTCCAGAA 


ATGGCAAAAT 


56101 


TAAAGAAACA 


GAGAACAGAT 


TAGTGATTGC 


TAAGGGCTAA 


GGATGAAGGA 


56151 


GAGAGAGAGG 


TAGTGTG AC T 


ATAGGAAGAG 


GGAGATCTTT 


AGTTTTGTAT 


562 01 


TTTGAATGAG 


ATGGCCATCA 


CATGAATCCA 


CATATGTCAA 


TCTATTAATG 


56251 


TAAATCAATA 


TTGTATTCCT 


GGCTTTGATA 


TATAATATAA 


TTTTATAAGA 


563 01 


- TATATAATC A 


TTGGGGGAAA 


CTGGATGAAG 


GATACAAGGG 


ACCTCCCTGT 


56351 


ACTATCTTTG 


CAACTTCTTG 


TGTATATAAT 


TATAAAATAT 


ATAATGTATT 


56401 


AAAATG T AT A 


AAATAATATT 


TTAAGTATCA 


GATACTGATC 


TTTACTCAGT 


56451 


ATATGAAGTG 


TTCTATCATA 


ACGTAACATG 


CTTTTCCTTT 


ATTTGTGGTA 


56501 


TTTTAGTTTC 


AAACTAAAAT 


AT AAATC AC C 


TAAAGATCTA 


CGACAGTTCT 


56551 


TTTGAAAAAA 


AATCTTGCTT 


TTAATTTCCC 


AGGAGTTTCA 


ACCTTAATCC 


56601 


TCTCTTTAGT 


GTTTCTTTAT 


TTGGTAGTGA 


TAGGGACTAT 


CAAAGCTTCT 


56651 


TACCATCAAA 


TACATTTACT 


G AC T AAAAAT 


AGAAAAATAA 


TTTACATTGT 


56701 


AAAAATGTAC 


AAATTGAATG 


ACAGTCAAAA 


GGTACAGGTA 


ATGAAGATAT 
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AAAAAAGTTT ATTAAAATTC TC TTTT AG AC 
TATATAAATA GATATGTATA T AAATG AC T A 
TTATATTTTA AACATTTTAA TAATATTGTT 
TTGTATATAT CTGAACATTT TTGTTCTTAA 
ATCCAGGAAT TTGAATTACT CTCAAGTCAC 
TGAACTCATT CATTTTCTTT GTGTTTGCTT 
CATGCATCAT AATCAGCCAT CAAATAATTT 
TCAGTTGTAA GAAGTGCTGT GTTTAAATTT 
ATCTGAATTA ATATCTGTTA ATGTATGTAA 
GCATGTACTT AAATTGATTA TAGGG AC T TG 
GATATTTTAA ATATAATCAA GGATTTTTTA 
AAGTAAAAGT AAGTCTTTGT TTACTAGTTT 
TTCTACCTTT TGGTTAAATG TGAGTGCAGC 
ATATTAGAAC TCTAATAGCT ATAATTTATA 
TGGTTCTAAG AAATAATC TG TTGGTTTTAA 
TATTCATCTT TACTCAGTAT GTGACATGTA 
TTCCTTTATT TGGGGTGTTT TTTATATATT 
AAACTTGGCA TAATTACATT TATATGGACT 
TAATTTTTAA ATTTGAAATG CATCTATGTC 
TCCCTTTGCC CAAATGGGGT ATGGTAAGTC 
CACCTCTCAT TTGACTGGCA GAG TAAAGCC 
TTAAGCCTTC CCTCCCTTTT GTAAAGTTGT 
ACCACAGGTA AAGTGTTAAG CTGATTCTAC 
GAGCGGTTAT ATTGGCAGGT CCTATTGCCT 
C AC C AAC AAA CAGAAAACAG AAGCCGCACA 
AATGGTAATA GCAAACAATA AGCCAGATAG 
CCTTAAGGCA GCTGGGTCAG GTGGGATGCT 
TTTTCTTTAC AAATCTCAGC CATTACATAA 
TAGTTATTAC TAACATTTTT AAAGACATTA 
ACAAAAGGTC CTTTTTGCCT TTCAGCAGAT 
TCTTGGGTAT TTTGAGATTG TGTTC T AC TT 
CATAAAATTT G C T AG TT AC T CTTGCTTTTT 
ACAATTGGTG CATATTATTA ATGAGAAACA 
TTTGTAGTGG AAAAGAAATG TGAAACTTTA 
TTCAACTAAC TACTAATTTT AAAACATAAA 
TTATATGGGT AAATATATAT TACATATAAT 

Fig. 2 (cont'd 32) 
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58551 ATATGTTTTA TATTTATACA TAATATACTA TATATTTATA CATGATATAC 

58 601 TAAATATTTT C C CAT AT AAA TAATAAAATG CTCTAGGCAT ATATGTGTGT 
5 8651 GTGTGTATAT ATGTATATAT AT AT AT AC C T TCATAACATA CATATATAAA 
58701 ATACTATATT ATATATACTC TAGGTATACA TATATGCCTA TATATGCACC 
58751 TATATATTTA TATATTACTA TATAATATAT AGTATATATT ACTATATATA 
5 8801 C T AC TAT AT A T T AC TAT AT A AT AT AT AG T A TAT AT AT AG T ATATATTATA 
5 8851 TAGTAATATA TTAC TAT AT A ATATATAAAT ATATGTGTGT ATATATATAT 
5 89 01 ATGCCTAGAG TGTTTTTAAT TTGTCAGTGG GCTGTCTCTG TAATCTATAT 
5 8951 GAAGAAATAA AATG TAG AC G TTATGTATAA TGATATTTCA TCTTGTTGTG 
59001 TGGCATCATA GTAATTCTCT TTACATATCT ATTCAGATTA CTTTTGCACC 

59 051 AGCCTAATAC ATTGTATGAT TCCAAAACCA AAGAGAGTAT GGATTGAAAT 
59101 GATATTCCCT TTACTAATAC TCAGTCTTGT CTATTTTATT ACCTTTATAG 
59151 ACTTCACCTA ACACAAGTCA GGGGATATTT ATCATCATAT TAATACAATT 
59 2 01 TTACTCTGAC CTTAAAATTA TGCAACTGCT AAAGGAAAAA TCAGAACCAA 
59251 ATAAACTGTC ATTAACAACC CCCCTGAAAA TCCATATTTT TTAAAAGTCA 
59301 TTTTATCAAG TC TC TC AG AC AAGATGTGAT ACCCTATAAG TTTAATCAGT 
593 51 TTTACTTTCC ATTTTCTCTT CATTAAGGTG ATAAAGATTA TCATTAGTAG 
59 401 AAAAATTTTC CCTTATTTGC CTCCTTTTCC ATTTACCCTA TTG AG TG AG A 
5 9451 AATTTAGCCT CTCATAACTT C T AAAG TAG C AATGTTAATC TGATAAACTA 
595 01 AACCAAGGTG AGATAAATTT AAGACAATAT TTTTTTTCTT CAACTTTTAA 
59551 GTTCTGGCGT ACATGGGCAG GATATGCAGG T TTG TTAC AT GGGTCAACAT 
59 601 ATGCCATAGT GATTTGCTGC ACAGATCAAC TCATCGCCTA GATATTAAGC 
59 651 CCACCATCCA TTAGCTATTC TTCCTGATTC TCTCCCTCCC CTAACTCCCA 
597 01 CTGACAGGCC CTAGTGTGTG TTGTTCCCCA CCATGTGCCC ACGTGTTCTC 
5 9751 ATCGTTCTAC TCCCACTTAT AAGTGAGAAG AAGTGGTGTT TGGTTTTCTC 
59 801 TTCCTGTGTT AGTTTGCTGA GGATAATGGC TTCCAGCTCC ATCCATGTCC 
59 851 CTCCAAAGGA CATGACCTCA TTCCTTTTTA TAGCTGCATA GTATTCCATG 
59 9 01 GTGTATATGT AC C AC ATTTT CTTTATCCAG TTTATCATTG GCATTTGGGT 
59 951 TGATTTCATG TCTTTGCTAT TGTGACTAGT GCTGCAGTGA ACATAATGCA 
600 01 TGCAGGTATC TTTATAATAG AATTATTTAT ATTCCTTTGG GTATATACCC 
600 51 AGTAATGGGA TTACTGGGTC AATTTCTGCT TCCAGATCTT TGAGGAATCA 
60101 TCACACTGTC TTCCACATTG GTTGAACTAA TTTACTCTCC CACCAACAGT 
60151 GTAAAAGCAT TCCTTTTTCT CTGAAACCTC TGCAGCACCT GTTATTTCTT 
602 01 GACTTTAATA ATCACCATTC TGACTGCTGT GAGATGGTAT CTCATTGTGG 
60251 TTTTGATGTT ACCCTTTTTT TTATATGTTT GTTGGCTGCA TGACTGTCTT 

Fig. 2 (cont'd 33) 
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60301 CTTGTAAGTG TCTATTCATA 

60351 TAATGGGGAA GTTTGTTTTT 

604 01 AC TC TAG AT A TTAGACCTTT 

60451 CCATTCTGCA GATTGTCTGT 

60501 TGAAGGTCTT TAATTAGATC 

60551 TGC TTTTGG A GTTTTTGTCA 

60601 TAATATTGCC CAGATTTTGT 

60651 TTGTAAGTCT TTAATC CATC 

6 07 01 AGTGGTCCAG TTTTAATTTT 

6 0751 CATTTATTAA TTGTTTTTTC 

6 0801 TGTCGAAGAT CAGATGGTTG 

608 51 TACATTTCAT TGAAAACAAC 



60901 


ATAAGTTTTT 


TIG 1 OtA 1 AA 


60951 


TTGTGGTTAC 


mm tv m 7\ t~" (~* T* 7\ 

TTA 1 1 PiSj 


61001 


ACTCTTTACA 




61051 


ATAAAGTATA 




61101 


TTTAAGGATT 


CAT AG T AG AA 


61151 


GAGAATTAAA 


ATGGAAATAT 


61201 


TGC T AAT AAA 


GGGTTTCTGC 


61251 


ACAATTTTAG 


TTTTAGGTTT 


61301 


TATTTCTAAC 


AGGTTAAACT 


61351 


TTTAAATTCA 


CATATATGAA 


61401 


AT AC C AC ATG 


GATGAATCTC 


61451 


AAAGAATCAT 


GCCATTTGAT 


61501 


TCTAAACATA 


GTGCTTTAAA 


61551 


AAAAGCAAGA 


AAATAATTAC 


61601 


.... GAAGGAAGAG 


GGAATAATAA 


61651 


TGAAATGTCC 


TATTTCTTGA 


61701 


TGTGATAAGT 


CAGGGGGCTG 


61751 


GGATTTTTCC 


AC AG TTG AAA 


61801 


TATAATCCCA 


GCCAACACTT 


61851 


GGCTAGGAGT 


TGGAGAGTAA 


61901 


CAGAAAATGA 


AAAAAAAAAA 


61951 


CTATAGTTCT 


TGCTACTTGG 


62001 


AGGAGTTTAA 


GCCTGCAGTG 


62051 


GGGTGGCAGC 


AAGACACTGA 
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TCCTGTCTAT TCATGTCTTT GCCCACTTTT 
T AC TTG C GC A TTTGTTGAAG TTC C TTG TAG 
GTCAAATGGA TAGATTCCAC AAATGTTCTC 
TCACTCTGAT GATAGTTTCT TTTGCTATGC 
C TATTTGTC A ACTTTTGCTT TTGTTGCAAT 
TAAAATCTTT GCCCTTACCT ATGTCTTGAA 
TCTAGGGTTT TTATAGTTTT TGGATTTTAC 
TTGGGTTAAT TTTTGTATAA GGTATAAGGA 
CTGTATATGG CTAGTCAGTT CTACCAGCAC 
AGTTTCCCCA TTGCTTGTTT TTGTCAGGTT 
TAGGTGTTTT TCACTAACAT AATCATAACA 
AC G AC TC AAA ATG TTC TTT A GTAACCAGTT 
TTACAAACTG CCATTCTAAT CATAAACATT 
AAAATGTGAG T AAT AT AG TT TAT AC AG CAT 
CTTTGTCAAA CTTTAATTCA TATTAAATTG 
AAAGG AG AG T AATTTTCTTC AAGTTTCACA 
TGATTAAACC TTACATTTCT CCACTATAAG 
TGAG TAAAAT CTTACATTTC ATTTAGTAAG 
CATAATTTTC CTTATTTTAA AAGAAAACAC 
TAGTAACCAA TTTTATGGGC AT AG TGGGAA 
GAAGTGACCA TCATGGGCAT ATATATATAT 
T AC TAT AC AG TAAAAACTAA CTTATGCTAC 
AAAACCCATG TAAAGCAAAA GAAAACCACA 
T AC ACTTGGG TGGTTTTTAA AACAGGCATA 
GTGTAAGCTT GGGTAGGAAA AACTATAAAG 
CACAGAAGTT ATGTAGAGGT TATCTTTGGG 
GAGAGGGACA AAGAAGAGCT TCTTGGTTCT 
CTTGGCTGGT GAATGCATGA ATGTTCACTA 
TTTTCATTTT GTTCACTTTT AT AT ATGTG T 
AGGTAAAGTT CAGGTGTGGT GGCTCACACC 
TGCGGGGCCA AGGTGGGAAG AATT AC TTG A 
CCCAGGCAAC AGGGTGAGGC ACTGTCTCTA 
AAAAAAGTAG CTGGGCATGT TGGTACATGC 
GAGGCTGAGG CAAGAGGATC AC TTT AG C C C 
AAC TAG G G TT GTGGCACTGC ACTCCAGCCT 
GTAAAAGAAT AAAATAAATA ATT AAAAG TT 

Fig. 2 (cont'd 34) 
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62101 AAAATATAGG AAAAAATGAG CATAGCCTTA TGCTAATTTT TCAGTTACTA 

62151 GG TC TG AT AT CATCACATTC CTTGCTTGTC ATTGAAAATT TTTTAAACTA 

62201 TGATACTTTT TTTTAGTGGT ATTTATCCAA TTAAATCTGC TAACAAATTT 

622 51 GGTGTATAAA TCTCAAGGGT AAGGGTATGT GGAGAGTGGG TGTGTTTGTG 

623 01 TGAGAGAGAG AGAGAGAAGA GGGGGAGGAG AAAAAGAAGG AAGAGGGAAG 
623 51 GAATGGAAAA AGATAATAAA GAGTTGTTCT GATAGATTAA TC TTT AG TAG 
62401 ATGTATTCCC TACAAATTGT TTTTCTCCAT ATTGCAGTGT CAGGTAAAGA 
62 451 AAGGCATCCC AGGATGAATT CAGAGCTAGG AACATGCACC TTTGTATCAT 
62501 AATGCTAATG GAAGGAACAT GTACATTCTA AC TGTTAC C A ATAATGGAAT 
62551 ATATTTCCGT TATTAAGTAA TAAGCTTTAA TTCTTTGTAT TTTTGTGATC 
62601 CATTTGATAG TAGGTGCCTC AGCATTTCCA CTC TGCT AT A AGTACATGGA 
62 651 GATATATTTT ATTTAAGTCA TCTTATTCAT GTCTTTCAAA AAGAAATTCA 
627 01 TTTTTGGCCA AGGATTTCCA AATTTTGCCC CATATATAGG TAT AG TTT AT 
627 51 TAT AG AC TTC GTTTGCAAAA TATTAAATCC TTATATCCTT TTAGGGACAC 
62 801 AATAAAATTT TATAAGTTTG AG ATAATG TA CTTGCAGTTC TACCTCAGGC 
62 851 CGTGGTGAGA GATTGAAGTG CCTCTTCATT TTAACATTTT GGGTTCAAGT 
62901 TGTTGCATAA GGGCATGCAA ATGGAAACTG GCCTATTTTT GAGCTTTAAT 

62 9 51 AAAATC GTC A AATACTTCTT AATCTTAAGA GTTATAGTTA TGTACTACAA 

63 001 TATGTATAAT TCTCTAATAT TTAAAACAAA ACCTGAAAGC CACAAAAGCT 
63 051 TACTGTGAAA TAAAATGTGA TGGAATATTA T TTC T AAC TG GCTTACCTGT 
63101 ATTTCCTTCA TTGAAGGGAA TATGAAGTAG AAAAGCCCTT TTATTGAAAA 
63151 GAGTTTGGAA AG T AAAG AT A ACTCTTTTCA ATTCAATTCT TTG T AAG TAG 
63 201 AAAAAGAGTA AAGATAATGT TTAGCTGTCA GCAGATGTCT G AC AC TTG AT 
63251 GGAGCGTATC ATTACAATAG AGCAGCTAAC AATATCTGCA AAGGTCATCA 
63 3 01 TG AAAG TATA AAAATGAGGA ATATTTGTCC ATTGACCATT TCAGTGACCT 
633 51 CTTTTTGGGC TTTAAGTCTA AAAATCTTGG CAGATCAGAA CTTTATATTC 
63 4 01 GGCATTTTGA GTGTCAAATC TCTACATGAT GTGCAAGTCA GAAGGAGTTA 
63451 TTACTTGCAA AATACCATCT TCTTTCAGAA GTTAAACTCA CATTAAATGC 
63 501 CAGGAGACTG AAACACTGAT TTT AAG AAG A C AAAG TTT AG AAAAGATGAA 
63 551 TGAAAATGTG TGTTAAAGAA GAG T C AC C AG TCAGAGCTAA CTATGATAGT 
63 601 CATAGTATTT AAAGAGTTGG AACACATGAA ATTAAGCATT TTG T AAAATG 
63 651 AAGGCTTTTC ATCCATCCAC ATAAGATTCT GACATTTAAA CTATGTTTCT 
637 01 TCCATTCTGT TCACAGGCTC ACCTTGTAGC AGCTTTTGAA AAG AG C TT AG 
637 51 GGAATATGAC TGGCCGATTG CAAAGTCTAA CTATGACAGC GGAACAAAAG 
63 8 01 GTATGTTCAG AAATTGCCAC TGG AG AC TG A AAGAAGACAG C AAATTGC AT 

Fig. 2 (cont'd 35) 
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63851 


AGGATTCTTA 


AATAATACCT 


GAAGCTCCTT 


AAAAATAATA 


TTCCAGGCTG 


63901 


AGTGCAGAGG 


CTCATGCCTG 


TAATCTCACC 


ACTTTGGGAG 


ACCAAGGTGG 


63951 


GTGGATCACT 


TAAGGTCAGG 


AGTTCGAGAC 


CAGCCTGGCC 


AACGTGGTAA 


64001 


AATCCCATCT 


CTACTAAAAA 


CACACACAAA 


AAATTAGCTG 


GGCATGGTGG 


64051 


CGGGTACCTG 


TAATCCCAGC 


TACGCAGGAG 


GCTGAGGCAG 


GAGAATCACT 


64101 


TGAACCCAGG 


AGGCAGAGGA 


CGCAGTGAGC 


CAAGATCACA 


CCACTGCACT 


64151 


CCAGCCTGGG 


AGACAGAACA 


AAAAAAAGAG 


TAATAATAAT 


AAAATAATAT 


64201 


TCAATTCTAT 


AC TAAATTAA 


AACAATGATA 


ATACCTTTCT 


TTTCAGATTT 


64251 


TAATTTAAAG 


ATTTTATCAG 


TTTACTCCAT 


ATTGGAACAC 


ACAAAGGCAA 


64301 


ACAAAATCCT 


TGCTGGGCAG 


TCTATTAATT 


TACTTCTGGA 


TGGAACTAGT 


64351 


AAAAGAATAC 


TGAATGTTAA 


GAAAGAGAAA 


CAGTCACATA 


AGAGAATATT 


64401 


CTGGGGGCAA 


ACTGTTATGC 


AGTTGACAAG 


AATCACACTT 


TGATAAGAAC 


64451 


TTTCACAAAT 


ACATGGTCAC 


TAAATCCAGC 


TATAGGGCAT 


GGCTGTAGGC 


64501 


TAAGACACAC 


AGGAAGGATG 


CCTGGGACTC 


TGCCAAGTAA 


GGG AC TTC AG 


64551 


GTTACAGCAG 


CTATGAAACA 


AAGGCCAATC 


CTGTGTAATT 


TTGAAATAAC 


64601 


AAGAACTAGT 


TGC CATC TAG 


GGATATCACC 


TTTGAAGAAA 


AGTCATTTGT 


64651 


TATATCAAAA 


TACTTAAAAT 


GAACCTAAAG 


GATTTTATGG 


TATGAAAGAA 


64701 


GG TATA C C AA 


AAAGAAAGGA 


ACGGAGAATT 


TAG TTC AC G A 


AGACAAATGT 


64751 


ATTAAAAAGG 


TCCATACTGC 


ATAGAAAGCC 


TGGTCACCTT 


TCCTGTGATG 


64801 


ACCAGTTAGC 


TTACTTCTCT 


GCTGTTAGTC 


CAGTGGCCTT 


AACTTCCTTG 


64851 


GATAGGTATC 


AG AG AT AGG T 


GAAACCTATA 


GAATTCTATG 


GAGTGTGTGT 


64901 


GTGTGTGTGT 


GTGCGTGCGT 


GTGTGTGTGT 


GTGTGTGTAT 


GAAAACTGTA 


64951 


AATGTGCATA 


AATGATCAGG 


TG TC C AG AG C 


TTTCATCTAA 


TTCTCAAAGA 


65001 


GACCCATTAT 


ATCAGAAGTT 


TTGGGTATTT 


TCAAGAATGC 


GTTCCTCTAT 


65051 


CTATCCATAG 


GAATGGCTTC 


AGTTTTGTCT 


TTAGATTCTG 


TAAGTTATGT 


65101 


GATTAGCTTT 


ACAAAAGTAG 


TATGTATTAC 


CAAATTTTGT 


CACTTTACAA 


65151 


- AAGTTTATTT 


TTAAAACAGA 


ATGAATAGTT 


CAATGAAATC 


AAAAG AG TAA 


65201 


ATCGAATATT 


CTTATAATTG 


CCAAGTATTA 


TTAGCACATT 


GTATTCTCTC 


65251 


TCATATTCTC 


CGTATACCCT 


GCCCGTGAGA 


GAGAATATTA 


TCCATTCCTG 


65301 


GAAAATCTGT 


TCTAGCACAG 


CTAACAAACT 


CCTTTTGAAA 


CATAAATTTT 


653 51 


CCTTTOTTTC 


C-TCCCTCCCT 


CCCTCCTTCC 


CTCCCTTCCT 


TCCTTTTTCC 


65401 


TTTTCTTTCC 


TTCCTTCCTG 


CCTCTTTTCT 


ATCCTTCCTT 


TCTCCTCCCT 


65451 


TACACCCTTT 


CTTCCTTCTT 


TTCCCCCTCT 


GTCTCCCTCT 


CTTTCTTTTT 


65501 


TGCTGCAGCT 


TGTCACTTCA 


CTATGTAATA 


TAAGAACCCA 


GCAAATAGAA 


65551 


TTAGAAGGCT 


TTTTAGAGCA 


GCTGACGGGA 


AAGAATAAAA 


ACACTGGCCC 


65601 


CCAGTATTCT 


TGAATGAGAA 


TTCTGGCTAT 


G TC TGTTAAA 


AGCTGGGTAA 



Fig. 2 (cont'd 36) 
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65651 


TCTTGAGCAA 


GTTTATCTAA 


CCTTTCTTGA 


ACCTCAAATT 


CACCTTCTTA 


65701 


AAAGTGGGGA 


TGATAATGAC 


TACCTTGTAG 


GAT C AC C ATG 


AGGAGTAAAT 


65751 


C AG AT AC TGT 


TATCATGTCA 


CATGCTAGGG 


GCTACCAAAA 


AATATTACCT 


65801 


TCCTTTACAT 


TTCTCTTTTT 


CCCTTGAAAA 


TTATAAGATA 


ACACCAAATT 


65851 


CCTCACTGGG 


CATATACCAA 


GCATATTGTT 


GGAAATGAGT 


GTTAGAATTT 


65901 


AAGTCTCAAT 


ATC TTTAATA 


AG TC AAAATT 


AATAGAATTT 


TTGTCCTCCA 


65951 


CCCAATATTT 


TCTTGAACTC 


TGTTATATCT 


GTAAGTGAAT 


TTTCTCATAG 


66001 


AAACATACAG 


AGAATTTTCT 


CATATACATA 


TAGAAAAAAA 


TGTAGAGGTA 


66051 


TGTTAATGTA 


TAATGCCTAT 


GATTAATGCC 


TGAATATTTA 


AAAATAATTT 


66101 


CTATAACATA 


AGAGATTTTA 


TAATGTGTCT 


ACATAATCCT 


TAAAATAACA 


66151 


TTGCCAAAAT 


TATAAAATTT 


TCTCAGAAGA 


TATCAGAATG 


TCTCATATTG 


66201 


TCCTTATCAC 


TTTTTTAACT 


GAAAATAAAA 


TCACTTCTTT 


TTGAATTGCA 


66251 


AACTGTATAC 


ACACAACAAT 


CATGGTTAAC 


TAG TTTATTA 


ATTTGAGATT 


66301 


ATAACTTGCC 


TATTCTCAAA 


GTGATATTTA 


AAAGCCTATA 


AAATT AT TTG 


66351 


CAATGTG'AAA 


TGGTATAATT 


CAAAGACAGA 


ATCTAATTAA 


AACCAGTAGA 


66401 


ATAATGTATA 


TAACAATATA 


CCTCAGCCTA 


GATAATTACT 


ACTGCAAGGC 


66451 


ACTGAAATGA 


ATTGAATTTC 


AAGGAAGCTA 


TGG TACAAAG 


GGAGATTGTT 


66501 


AGGTGTGTTT 


TATTCTCATT 


TTCTGACCAG 


GAGAGCATAA 


TTTAG AC TG A 


66551 


GGAGAAAACT 


CTTTGGCACT 


AAATTCAAGG 


ACGAATTTAT 


TGCCAAGGTT 


66601 


TTTAAATTGG 


GGTCATGGAA 


TAACAAAAGA 


CAAAATCACT 


GTTCAAATAG 


66651 


ACATTTCTCT 


AAAAGCTAAG 


GGCATAACAT 


TTAATCATAT 


TTCACTAAAG 


66701 


GCATTTCTTC 


AGGGAGCTGA 


GATAAAAGGG 


TATATTGCTC 


TCTGGTGATT 


66751 


CAACAATCCT 


GAGAAAAGGC 


TTGTGAAGTA 


TAG AGC AG AG 


ATTCTTAAAC 


66801 


TCCCTTCCCC 


AAGTTATAAG 


TTTCATTTGT 


CTATATAGTC 


ATTCATCAAG 


66851 


TTTATATTGA 


ATTTGTGCTC 


TTCTAATGAC 


AAAAC AG T AC 


AGACAATATA 


66901 


GATATAGAAT 


GATAGATATA 


GGTCTATATC 


TATAGACATA 


CCTATCTACT 


boy 3 1 


AGAAC I C 1 AA 


AAbLAIAl 1A 


T AC A i G 1 A X G 


T AAT AT TC C T 


C ATGG AG TTT 


67001 


ATATTTCTCA 


TAT AT ATC TC 


AT AT AT ATG T 


ATCTCTTTAT 


CATGGAGTTT 


67051 


ATATTTTAGG 


AGGTCACAGA 


TGATAATAAA 


AATATAATTA 


AAACAGGCCA 


67101 


GGTGTGGTGA 


CTCACACGTG 


TAATCCTAGC 


ACTTTGAAAG 


GCCAAGGCAG 


67151 


GTGGACTCCC 


TGAGATCAGG 


AGTTCAAGAC 


CAGCCTGGCC 


AACATAGTGA 


67201 


AACCCCATCT 


C T AC TAG AAA 


CAAAAATTAG 


CCAGGCCTGG 


TG G TG GG C AC 


67251 


CTGTAGTCCC 


AGCTATTCAG 


GAGGTTGAGG 


CAGGAGAATC 


ACTTGAACCT 


67301 


GGGAGGTGGA 


GGTTGCAGTA 


AGCCGAGGTC 


ATGCCACTGC 


ACTCCAGCCT 


67351 


GGGCAACAGA 


GCAAGACTCT 


GTCTCAAAAA 


AAAATATATA 


TATATAATAT 
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67401 


ATATAATATA 


TATATAAATA 


67451 


ATATTTATAA 


ATATGTAATT 


67501 


GGATTTCAGG 


TGATGATAAG 


67551 


GG AT AC TG AG 


AAGCTGGTTT 


67601 


AGG TG<3 TT AC 


ATGTTCTTGA 


67651 


TAAGACTCAG 


ATTCTAGTCT 


67701 


ACATdGCATT 


AATAGCCTGA 


67751 


GACCAAAATA 


ACTCCCTTGA 


67S01 


CTATTTCCAG 


ATGCCAAATA 


67 851 


AGAACACTGG 


TTTTATATAC 


67 9 01 


ATGAATAACT 


TAGAAGGCAG 


D / ~> J- 


TGTCATGTGC 


ATTCTCAAAG 


68 001 


GAACCCTGGA 


GGTGAAGGTT 


6 8051 


CAGCCTGGGT 


GACAGAGAGA 


68101 


AAATAAAAAA 


TAAAAAATAG 


68151 


AAG C AG TATT 


CCCAATAGCT 


68201 


GTCTGTAAGT 


AGGTAAAAAT 


68251 


GTGAAATTAT 


CTCTAGGCAT 


683 01 


CTATGATATT 


GGCGTAGTTT 


683 51 


TTTTCTCTTC 


ACCAGCACTT 


68401 


CCTGATGCCC 


TGAGGCATCC 


68451 


ACCATCTAGA 


ATGGTACTAG 


68501 


ATTGCTACTC 


AAATCATTGT 


68551 


GAGAAAGGCT 


CAACTTCTAG 


68601 


ACATTTTTAC 


AGAACTACTT 


68651 


TATATTCACA 


GCTAAATATT 


68701 


TTAGTTCCCT 


GTAGCTGTCA 


68751 


ATGAGGGGTT 


TTTCTTGTTA 


68801 


CTTGTATGAG 


TGATGAAGCT 


68851 


CCTAGTAGTT 


TGCTCATTTG 


68901 


TACAAAACAG 


GAATTGAAAC 


68951 


TTTGAAAATA 


GATAATGCTG 


69001 


ACATTAACTC 


TCCCTAATTT 


69051 


TTTAAAAAAT 


ATATTGCTGG 


69101 


GCATAGAATA 


ATGTGAATCT 


69151 


TTTGGTCAAT 


GTAATTTATT 
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TATATATTAT ATAATATATA TATAAATTAC 
TATATATATA ATATATAATT AAAACATATA 
CACTACTGAA AAAAGTAAAG CTGAGAATGA 
GGAAGCTAAA AC AC AAAG T A ACAAAGGCCA 
TTACATACTT TAAAAATGGA TAAACTAAAT 
TTGGGCTTCA CAGTGTGATT TTCAGCAATC 
AACTACATCA AAATTGTCAT TTGATTTATA 
ATAGAGAGGG ATTCACTCCT AACACTTTTC 
ACACGGAATC TCTTGCCAAA TTTGTGTGGC 
TTATAGCCTG GTAAGAAAGA AAAGACATGT 
AAAATTATCA TGCTATTAGA CTCAGTACAA 
GAAACATCTG CAGAGGCAGG AGAATTGCTT 
G C AC TG AG C T GAGATCATGC CACTGCACTC 
GACTGCATCT CAAAAAAATA AAAATTACAA 
TGATCAATCT GGCAGCATTT TCTGAAAGTT 
GCTAAAAGAA GACATGTTAT ATAATACTAA 
TAAGAGAATT GTTAATGTGC TTGCTGGGGA 
T AC C C TAT AC CTAACCTAGG ACTCAGTAGA 
GACCAAGAAT TTTATCCTGA TTTCAGATCG 
CTTCACCAGG ATTATATGAA AAAAATTAAA 
ATTATATGTG CTGAAATAAC TTCTTTTCTC 
CTATGTACCA CTCTTGTCAG AATCAAGGAA 
GCAGCTTAAT TTTCTCACAG AAGGCCAGTT 
GAATCCAGCA AAC TAT AT TT TTTATAAGTA 
CTAAATCCTT GTGTTCAAAT TTACTAAAGC 
TCAGAATTTA AAATTTAAAA GACTTTCAAA 
TGCCAAGGCA ATTAGAACAT ATGTTAAGGT 
GAAGGTCAGA GCAGGGCAGA GAAGTAGCCC 
CAGATATTGA CTCCTATGCT AACCATAAAG 
TTACCTCTCT GAAACATTTT TTTGGGTGAC 
CTTCAAAATA AGGGAATTTG AAACCAAATC 
CAACTAAAAA TTTAGTTGAA TAAGATTTTT 
ACGTTATGAT ATTTGCCATC TAGAAGTGTT 
AGTCAGATGA TGCATCCATT AATCTTTGGG 
AAAATTTTCA AATTATTTAC ACTACTGGTA 
TG AAAC TAG A TGCAATAGGG ATGGCCAGGT 

Fiq. 2 (cont'd 38) 
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69201 TATTTCAGTA GAACAACTAG CAAGACTTCA GATGCATGGT GGAGTGGGGA 
69251 AAGGAGGACC TGTTTAAGGA AAC TAG AG C T GGGAAGTGTG AGATTAACTT 
693 01 AGTGCCAATG TGAGGACCTA AAAAGCAGAT GTGGTGGAAA ATTTAAACAG 
693 51 GCTTGCCTAG AAGGTCAAGT TAGTTGATGA CACTTGATGA GATTGTCCCA 
69401 AGCTTTGGGA TTCTCAACAA AGTCTTTGTT AGTGAGAAAT TTGGAAAGAG 
69451 ATCAGGTATA GTTAAGAAAC TGGGTTGGAA AGGCCACCAG GAAAGGCGAA 
69501 TATTCTGACA CAAAATTTGA TCATTTTATT TGGAAGCATT TCAAGCCTGA 
69 551 CCTGAACGAA TTGTTTAGCC TC AG AT AC AT GC ATAAAACT GTGAAAAGAG 
69 601 ACATTGACTC AATTTAGCTT CTTTAACATG AG AAAC TTTC GTGGAAAACT 
69651 AG AAC TTT AC AAGCTCAGCT GGTGTTGGGG GCATCATTAT CTTGAATAGC 
69701 TCACTGGAGG AAAATGAAAT CTTAGTTTGG TTCTCAGGTT TTAAAATATC 
697 51 TATCATTTTT GAAAAGTGTG AAGTAACAAA ATATGATCTG ATTATCTTAT 
69 801 TCCTAAAATC CTTTGCAGAA TTATCCCAGC CTCAATCTTC TCTTTAGTAT 
69 851 TTAATGAGAA TAAGAAACTG GAAATGACTG AATTGGAAGA G TAG AC TTT A 
699 01 AATCCATATC T TG ATGG CAT ATACATTTTT C AG TTTTTTT TCTAAATGAT 
699 51 TAATGAGGAT TCTCAAAACT TGAGTATCTT CTATGTTTCC CTTCAACATA 
7 0001 AAGAAATTGT ATGAAAATAT TTTAAAAATT TCTAATGATT TTATAGTTAG 
7 0051 CTATCTTGGG AATTCATTTC TAATCATGTA CCTCATCCAA ACTCCCCACT 
70101 ATGGACAAAA ATAAAATAAA AATT ATT AG T TGCATCTGAA GGCCACATTA 
70151 CAATTTCTAT G C ATT AT AG A AAC C TG AG AA AATGTATCTT AAAAAATAAA 
7 0201 TGTGAACAAC TAACCATAAT TATGAAGAAG AAAAATGAAA ACTAGAAATA 
7 0251 AACTATTGAA AAATG TC TAT GTATC AG TT A AGTTTTTATT TTAAAATTCT 
7 0301 TT ATG TTT AT CTCTATAATA CTATTGGGAA AGAGAGAAAG GAAAACCTGA 
7 0351 CTTTGTTCTC ATCCAAAGGA GGTGATTCCA CTGATTTAGC CAAAATAAGA 
7 0401 CTTCCTGGTT ATAATAAATA ATAAAGTTTT TGATGTTTTT TATATGGTAC 
7 0451 CCCACTCACT AGGTGATCAG ACACCCTCCT GCAAAAAAAA AAAAAATACG 
7 0501 TATGCAATAA AGTTAAAGTT TTATGTTATT CTTTCAAGGG GAG AAAC ATC 
7 0551 TGTTTAACAC AGACCAGAAT ATTTCAACAA AGTCATCCCA ATATTTATGG 
7 06 01 AGATCATAAA TCAAGCGAAA AAATATATTC ATCAACAACT AAACAAACTA 
7 0651 CATTAAATAG TCTCAAAGCA CATTTTCACT TTTTTTCTGA CAGGAAAACA 
7 0701 GGTTTCACAA GTGTGGAGAC ATTTTACCAT GGCTTTTAAC AGTGAGGAAG 
7 0751 GATGTTTAAA T AAAGG G AAA AATTATATGG AAAGCTCAGA GAAAAGAGAT 
708 01 GGGTGTGGCT TGAGTGACAA GG TG AG AG C A GATCTCATTA ACTGAAATGA 
70851 GAGAGAAGGA AGGAATTTTG CAAATATGGA AAGATAACTA GTGCAAGTTT 
7 0 901 GAACAGATTA TGTCAATCAA TG TAG AATTT GGCTATCTTT TTAATCAAAG 

Fig. 2 (cont'd 39) 
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70951 


AAGACTATGG 


AATATTTTAT 


AGGTGTTTGC 


TTATAC TC AA 


AGTTTTAAAG 


71001 


AAAT AAC AG T 


ATGAATTTGG 


TTG AAC I AAT. 


TTTTTTCATA 


bA TAGG ATTC 


71051 


TCCCAAGTTA 


TATAGCATAT 


ATATTTCTTA 


AC TAG TTATT 


CTTCCTTTTA 


71101 


CATATATTGT 


GCCACATTGA 


G T AAC AAC TA 


AC CTG C T AAT 


AGC T ATTGGT 


71151 


TTTTAAAAGA 


TAATTAATAT 


TAGAAAGTGA 


TCATTTTTCT 


GTTTCATATT 


71201 


AAACATGATA 


TTC TGAAAAA 


GCAACATTGC 


CTGAATGTTC 


TACATTTTAT 


71251 


C TTTTTG AAA 


ACAGGTTTTA 


TAAGAGATTT 


CTTGTGAAAA 


GCTGAACGTT 


71301 


C TG AC AC TG A 


AATAAGTCAG 


CTAACTCAAA 


GCTAAGCTTA 


ATTTTTTGAC 


71351 


ACTGTTGGCA 


TGAGGTCTCA 


TTCCCAATTT 


TTTCATTTAA 


AGCCACAGGC 


71401 


AAATGTTTTA 


ACAGATTTTA 


ATCCGTAGTA 


CAAGCATTAT 


TGATCTTAAA 


71451 


TTTAAGGATA 


AAAACCTGAT 


TTTAATTAGA 


ATTTAATATG 


CATTCTAGTA 


71501 


TTTACGTTGT 


ATAATTAATA 


TTTACATTCC 


ATGATTCCAC 


T ATG TAC C AT 


71551 


TTATTTCTTT 


TTGAATAAAT 


TTCCAGTAGG 


AGCAGAATAA 


ATTTTCAGTG 


71601 


AATATTTTAT 


TTCTTGGGGG 


ATATTTTTAA 


ATGGAAAATA 


TATTAAGTTT 


71651 


CGGTAAAATC 


TGTTGCTAAT 


TTGGCAGTGG 


ACAGAATATA 


AAAATTG GAG 


71701 


AGACTGAGTC 


ATTATGATGA 


ATTGGGTCTG 


ACTTTTGTCA 


TG AC AC TGG A 


71751 


AATTTCCCAC 


AAA TATTATA 


TTCTTCTTTT 


ATAATAAATA 


TAG TCG AAAT 


71801 


GAATTGCAGT 


CAAGTATTTG 


AAGACCCATC 


TATAAATTTA 


GGCGGTTACT 


71851 


GTTGATTTTT 


CATTATGAGA 


GATTCTTCCA 


CTCATAAGCT 


A C TAAAAGTA 


71901 


CATAAAGAAG 


GTCTGGTTGT 


TTGTTTTAAA 


TGTGACTGTT 


CTCTATCAGG 


71951 


AAAATGTCAG 


GTATCCGATG 


AAAATAGATA 


TATGAGGTGC 


CAGGTATCTA 


72001 


TTCCAAACTT 


GGATATCACT 


TCAATTAGCA 


TCATCTTTTT 


TTTTTTTTAA 


72051 


AGTGTCTAAG 


G TT AG AAT AG 


TCACCAGATA 


TTCCCATGTA 


TG AAG C AATT 


72101 


TTCTGCAAAG 


GCCGCTGTGG 


ATGATCTTTT 


TAAAATATAT 


ATTCTGGGAG 


72151 


ACATTGAGTA 


AAGAGAAATT 


ATTTACCAGA 


GAATGAAGAA 


CCGAGGCCCG 


72201 


ATTCTTTGGC 


TTTCTGCCAA 


AG ATG C TG AA 


GGC AG TG ATG 


AATGACAAAT 


72251 - 


ACATTACCAA 


GGAATTCTCC 


CTCTAAGAGG 


C TG ACAAAG A 


TCTGATTTTT 


72301 


AGGATTATAT 


TACCACCAAG 


AAGATACCCC 


TTGTCACTGA 


GCTTCTAATG 


72351 


GAAATATGGT 


CTATACTGAA 


ACAATTCTCA 


GTTCTTTTTC 


TTTCTATCTT 


72401 


TTTTTGAGTT 


ATTTTATCTT 


CCAAAAATGA 


GTTATTTCTG 


ATAAAATAAT 


72451 


TCACTTAAAT 


AATTATGAAA 


GTTCAAATTT 


GTGCAAATAT 


TTTTATTGGG 


72501 


ACATCTTAAA 


ATTACTCTAA 


ATTCAAAAAG 


AAAATATATG 


C TTT ATT AAA 


72551 


ATTTGATCTG 


TAAGCTGCTT 


TGTTTG T AAT 


TT AAC T ATT A 


TATAAAAATT 


72601 


G T AT AAT AC A 


TATATTTTAT 


TTACTTTATT 


CCTGTGTTGC 


TTTGGCTTGG 


72651 


TGAGACTAGG 


TCTCCACATT 


AG GAG TTTT A 


CTGAATGAAA 


AAGTATCAGA 


72701 


ATGTAACATG 


ACTTTGATAT 


GGCATCAGAA 


TTTAATAAGA 


TGACATTTAA 
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72751 


TAGGAATTAG 


GGGTAAGTTC 


72801 


TTTTGCAGGC 


ACAAAATACT 


72851 


AAAACTTCAG 


GTTTGCAGTT 


72901 


AATAG TG AT A 


TCTAAATATC 


72951 


TGTTATCTGA 


AATGAGTGGT 


73001 


GTTCTGGGTG 


CCACCTTTGA 


73051 


TAAGACAATA 


AGAAAAACAA 


73101 


TACTTTAGCT 


GAGAAAGAAG 


73151 


AGCATCCAAA 


TCTTCCAGGA 


73201 


AATTCAACTA 


TGAGCAGGGA 


73251 


TCCAAACTCT 


GTTTATTCCA 


73301 


ACCATTGAAA 


TGCTGAAGGC 


73351 


GGGAGCACTG 


AATGGTCCAG 


73401 


ATTTCATTTC 


CACCCAATAT 


73451 


GTAGAAC^GC 


ATTTACTCAG 


73501 


TGTATTAATA 


GTTG TATTAA 


73551 


TTGGTATTAA 


TGTGATCAGA 


73601 


TATCAGATAA 


TTTTTAAACT 


73651 


TACACATCTC 


TCTACTTGAT 


73701 


AATACATGCA 


TATCTGACTT 


73751 


T AATC TAG AT 


CTGTATTTAA 


73801 


AGCAGTGGGC 


ATTGGGAACA 


73851 


ACATTTCCTG 


TAGATAATTT 


73901 


GCTCTGTTTT 


AAATTATTCC 


73951 


GGTATTTCTG 


CCACTGCCCC 


74001 


CATTATTAAT 


ATCGTGTCAT 


74051 


ATTTCCTGAC 


AC TTTTC AAT 


74101 


TTAAAAAATC 


CTGTCCTAAA 


74151 


TTCCAGAAGA 


ATTCTGAGAA 


74201 


CCATCTTTAG 


ACCACTGGAG 


74251 


ACAATGCTCT 


CCAGGATCAG 


74301 


AACGTTTATA 


ATTCTTATTG 


74351 


GCTGATATTC 


ATTTCTTCCT 


74401 


TG AC ATC AT A 


CATTCTTGGT 


74451 


AATATTTCTT 


TTGTAAATTC 
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CAGGTTTTAC AC TT AAAT AC AAATAATCAA 
TCAAACAAAA TCTGAAATCA TTCATTTGAC 
GACAATAAAT AC AAT AC AAT GCAACAGTGC 
TAATGTAATC ATAGGTAATA TTAGTAAG TG 
GTGATATCCT GCTTTACTTT GTACTGGTGA 
AAGGAATAAA GACTATTCAT ATCTCTTTTA 
ACAAACAAAC AAACAAAAAA CCACCTCCTT 
TTATTAGGTA CAGCTTGACA AGTTCAGCTA 
GGTTGTTACT ACATAAAATC AAACCTTTTT 
GATTTTATTT TTCTTTCGGG TACTAAAGCT 
C AGG AATC TG AACTTATAGA ACTAAGAGAA 
TCAGAATTCT GCTGCCCAGG CGGCTATTCA 
ACCATCCTCC CAAAGGTATA TTTAGAAATC 
AATAG G C ATC TATTTTATTT ATTAATTACA 
TGTCACTGTG CATTATTAAT ACATACTAGT 
TACATACTAG TAG T ATT AAT ACATACTACG 
ATCCTAGAAT TTTAGAACAG TGACTTCCAT 
GATCTTAAGA AATTTGGTTC TATAGTTGTA 
TCAGTGGAGA TGGAGATGGA GTGGTTGGTT 
CAGGCAAAAC AAACCCATTA ATGAGTATGA 
AAATGAAATA GTCAATATGA TGATATAGTA 
AC TTTTC CTG GATGGAGGCT ATAAAAAGGT 
TG AAA C AAT A AAAACAACGG GTGAAAGGTA 
TATGCTTAAG CAATTCTAAA CAATGAAAGG 
TACCCCTGGG TTCACCACTG AAGAAATGCT 
TTTTTTCCTT TACATTGGTT CTATTTACTC 
GGCCTTCAGT GAGCTCAGCT CTTTCCCAGC 
ACATGAATGC CTTATTATCT CTCTTTTCAT 
AAATTTTATG AAGTCTTTCA ATGTCTTCAG 
TGTAGCTCCT TTTCCCTCCA CTCCACCAAA 
CAGAAACTTA CATGACACTA AATTCAGTAA 
T ATT AG AC AG ACATGGAAAC AGCATTTGAT 
ATGTGAAACA TCCGGTTTTT CTAATGTTCG 
TTTTCTTCTG TTCCTTTGAA ATATTTTTTC 
ACTCTTTTGT ATCCATTTGT TAATTGTTGA 

Fig. 2 (cont'd 41) 



7 4501 TATCCTAAGC TCTCTTCCAT 

7 45 51 TAGAAAATCA TCTCATACTC 

74601 CTAATAACTG TCAAATCTAG 

74651 ACCAAGAGAG AACTCCACGT 

747 01 AAACCTAACT TCTCCAAATT 

74751 TCCTTCAGTG CTCTTTGCCT 

74801 GCCAGAAATC TAAGTATTCT 

7 4851 TTACCAAGAT CCGTTGATCC 

7 4901 CTTTTCTCTG ATTTTACTCT 

74951 ACCACAAACC CCTAAATAGC 

75001 ACCAGTCTTC ACGCTGAAGC 

75051 GTATCCCATT CATTGCCTTT 

75101 TAATCTCTCT TCACCAGCTT 

7 5151 AATTAAGTGC CAGATGATGC 

752 01 TGCATATTCC ATATTTCAGC 
75251 TTCTTTATGA ATTGTGTTCA 

753 01 CAGCATAAAG ACTTTTCGAG 
753 51 TTCCTTGACC CCTACATTGA 
7 5401 CAGCAATACT TCCCTTCTTA 
75451 AAGGTATGTC TTTCCTGATT 
75501 TTGAAAGACA TCAAGATGCC 
75551 TATTTGAATT TTTAAAAGAA 
75601 TTAAATTTTA GGAAAACAAT 
75651 G AC AAAG TAG TTCTGTAGGG 
7 57 01 CATGTGGGGG AGTGATTTAT 
75751 AGCTTATCTT AGAGCTATAA 
75801 - CGTAAATAGG TCCAGAAGGG 
75851 CAGCACAAGT CACTGAAGTG 
75901 GGCAACAGGT AGCTGAATAT 
759 51 TTGCTCAGAA ATGTGGGAGG 
7 6001 ATATCTAAAT TGATCAAGAA 
7 6051 ACTAACAAAA ACCATGGTTA 
7 6101 ATTGTATTTC TGTAATATTA 
7 6151 GTGATTCTTA TCACATTGAC 
7 62 01 TGGACTAATT ATGTGTCTTT 
7 6251 ATAAATTTAT ACTGGAAATC 
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TATGATTCTA TGCATCCTAT TTAAAATATA 
TAGCTGTAAT TTTTATTAAT GTGCTAATAG 
GTCTCCAGGC CAGGCTCTGT ATATCCAGCT 
GGATATCTTT GGATGTCTGT TTTGCATCTT 
TGCACTTGTC TTCTGTCTCA GACCTGCTGC 
CAGTAGATAG CACCACCATC CTTCCATTTA 
TCATAACTCC TCCTCTCCTC ATTGAATAAA 
CATTCCTTAA ATATCTCTTG GATCTGTTAA 
TGCCATCCAT CACCTCTCTC CTGAACCATG 
CTTCCTCTTC TTAATCTTAT CCTGCTTTAC 
CAGAATAGTC ATTAAGAAAC ACATCTACAG 
AGAATGGAAT ACAGACTCCT CAGCATGACA 
CATTTATTCA ACAAATATTT ATTCATAACC 
ACATATAGAC TTCTTGTTCT GTTGTTGCAT 
TATCCTGAAT TGTTTTCAAT TATTCATAAG 
TTCCATTTGG AATATTCTAC GTTGTTTGAT 
ACACTGCAGC AGCAGTGAAC CTAAATATGT 
ATGACACCCC CTGTGATATG TTTCTGGAAG 
AAATTACATT ATACTTTGGG GCTTTTATTT 
TACAATAGTA GAGCTTGTTT TTTCACCCTT 
CATGATGATG TCTTGCATGT AACAGGGGTT 
GAATAAAGTA ATTTTTAAAT GAATTTCAAT 
TATATAAAGT G AG ATATGC T TAAATTGAAG 
GCTACTTCTT TCAAGACTTT AGCAACTTTC 
GTGATGCATG GAAAATTACT GCATATTTAA 
TAAAGC AG C T TATGTTCTAA ATCTTCATGT 
ATTTAAAAAG CCTTAATCCT TACTTTAACA 
AAACTTGCTG AAAGGATTCC TTTTATGTTA 
ATCTACAGAA ATTGAAAAAT TGGAATTCTT 
GGTGGAGCTT AAGGTAAAAA ATAACAGTTA 
ATATGAAAAA ATAATTTGCT AGGTTTTAAA 
TAAAGGTTTG AATATATATA GGATAGTTAG 
AAACTCAGCA TTAAATTTAA TGAACACAAA 
CATTGACATT ACATGGAAAA AATAGTCAGT 
CCATGGGTTA TTAAGGTAAT TGTATGGCAT 
ACATTGAAAT TCACTTTTAG AGGCCCTTAA 

Fig. 2 (cont'd 42) 
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76301 


AATATTTCTG 


TAATATATAT 


TTTTAACATA 


TGATCTTAAA 


AGATATATTT 


76351 


GGAATGACAC 


AACAGTTTTA 


TAGACAGGCC 


TGACTATCAC 


ACAACCACAC 


76401 


ACCAATTTGT 


GAATGTGTTT 


CTATTTCCTC 


TAAATTAATG 


CATCACATTC 


76451 


ATTAACAAAG 


TTTGATAAAT 


G AC T AT AG TC 


TATAATAAAA 


TATTTTTGTT 


76501 


TACAAACATA 


TTTAAACACC 


TGC TATTAAG 


TATAGGCATT 


ATCAGATCTT 


7 6551 


A AA ATAPAAA 


GATTTAAAAA 


ATTACCCTGT 


GGTCATGGAG 


CTCACAATCC 


76601 


A PTfiP AA AAA 


TAATGTTTGT 


GATAAGAAAT 


TTGAAAGTTG 


AAGG TAATAG 


7 6651 




CTTTATTTTT 

v*- XXX ..Pi. A x X X X 


C AAAATG T AC 


CATTGCTTTC 


TAAGTCACTA 


/ D / VJ X 


pttptptgta 


AATATGGAAT 


TGTTTTTCCT 

X V7 X X X X X N*- X 


TAAGATATAC 


CAAATATAGT 


/ O / Jl 


ffJfiATAAPfir 
J. oOnlnALUU 


ATGTATTAAA 

i \ X VJ X I^l X 


ATTCTGTC AG 


C ACTAAG TTG 


TTTTTTARA P 

X X X X X X ^i\JJ.C^\_ 


/ DOUX 


ATAfiTfiATA^ 

/-v 1 nu J. \Jfr\ ± -r\VJJ 


GPAAAPATAG 


TTATATTGAA 

X X * \ X Sir X X VJiul 


TGAAAAATTA 


fiAATPAAATT 


/Dfljl 


rp 7\ rprp A A A r* A 


TfITP;TAPTGA 


TTOATAPPAP 


ATPPPATATC 


PTTfiTATAClP 

\— X X vj X -/i. X AO V^. 




A A ""PA f" 1 A. A 

AA 1 A*., AAIJV3 1 


ttpcz a a ttt a 


T A A TP, P»T A A A 


P A A A A TAP AT 


jr\v_ v_3\-3 1 Hj 


/or Jl 


rp/-»rppi/-i A rpA A 


X X X X .rivj X v.. 


TAGTGGGAGA 


fiPAfiAAQHTA 


AAQPAATCTA 


/ / Uul 


Xb 1 LaA X V_.jH 1 X 




A AP ATGTATA 


PPPAAAPAPT 


TATA AP3TTPP 


77051 


A APATf^P^APA 


ATAATGGGTG 


CCATAGGGAA 


GG AGGG TAC C 


A AGGA APPTA 


77101 


V — J. VJVJ.tt.vJVJ J- X Jri. 


CATAGGGAAG 


ATTATTCCAA 


GG TAG T AAT A 


TTTAAGTGAA 


77151 


TATCCAAGGA 


AT AATTG TC A 


ATCACTTTAT 


AAGTACTGAG 


GG AGG AG TAT 


77201 


TTCAAAAGAG 


CTTTGAGGCG 


GAAAATAAAT 


TAGTTCCTTT 


ATGGAACTAA 


77251 


TG TAAGG AAA 


ATA C TAAGC A 


AACATGTAAT 


AAGAAGAACA 


C GG TTG ATG A 


77301 


G TT AAG AACT 


GACAAGATTA 


CTGAAGGATT 


GTAGGCC ATA 


TTTAGAAGTT 


77351 


GGATTTTTTA 


TCTATTCTTA 


TTAAAGTGAG 


AAGTTATTGA 


AAGGTCTTAA 


77401 


GTGGGGGAGT 


GATGATGAAG 


TTTGCCTTTT 


AAAAAAGATT 


TTTCTAGCTA 


77451 


TTGTTTATAG 


AATGGTTTGA 


AGATGAATAA 


GTCCAATAGC 


TATACTTGCT 


77501 


G T AAAGGTTA 


TGTTGGTAGC 


TTGAACTGGG 


GCAGTGGTGA 


CACAGAGGAT 


77551 


GGGAGATGGA 


AAATGACGAG 


TGAACAAACA 


C AT AC C TG AA 


AATTTAAGTT 


77 601 


TA A A AATAGA 


CCTCTCCATT 


AATTCAGATT 


GCTGATATTC 


ATTCGGTTAG 

-» J- X X ^ — vjvj X X -LXXJ 


/ / DjI 


LLA1 it- J. 1 1 A 


C 1 VJ AAL. X 1 1 A 




X A X AC 1 bAA 1 


rp A A A T 1 A r"*T""P A 

1AAA1AL1 X A 


77701 


CAAGCACTAA 


AAAAGAAATT 


GTTAGGGAAC 


AG T AAAATG C 


ATTTCCTTCA 


77751 


TTTCACAATA 


TTATTAATAT 


TATGGCTTTG 


CTAATCTTTA 


TTGGTGAATG 


77801 


CAGTCATAAT 


TG AAGGTAAC 


TGATACTTCC 


AAGGACTACT 


TTTGACCTAG 


77851 


GATTACTATC 


TTTTTAAAAA 


TTTAGTATTA 


AAGAAGTCAA 


ACACAATTTA 


77901 


TTAATTCTGG 


ATATAATAAA 


AATTCTGAAA 


TAC TTTAATA 


CTTTGTGCTT 


77951 


TTCTATTTGT 


GAAAGTTAAT 


TATTAGGAAC 


GAG C T AG C AA 


ATGCTACTTC 


78001 


TTTTTCAAAA 


AGCTAATGGC 


CAATCACAGC 


AAAAATTTAA 


AG C ACTAAG A 








Fig. 2 


(cont ■ d 


43) 
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78051 


AATACCTACA 


CATATTCTTC 


TATTGCCCAT 


TTATATGACT 


TCCATAATAG 


78101 


TTGATTAAAG 


G AT AC CGG AT 


TCCTTTATTG 


TTGAATTAAA 


ACCTCCTACA 


78151 


TGAAAACCTT 


GATTTAGGTT 


TAGAAGTTGG 


TAATGTTTTG 


GCATGCAAAA 


78201 


CCAGTTAATG 


TTCTCATCAT 


TACTTTTTAA 


AACAATGTTA 


AGAGATGAAT 


78251 


TCTAGGGATT 


ATAAAAAAAA 


AAAAGCTGTA 


TGTGTTTCTT 


CCTATAAAAT 


7 83 01 


TTTTCAGCAT 


GATTGCCTCA 


G T AG AAAAAT 


TAAGGGACTT 


ATTGATATAT 


78351 


ATGTATATGA 


AGGTGAGGAT 


AC A CAT AT AC 


ACACACACAT 


AT AT ATG TAG 


78401 


G TAAATAC AT 


ATATTACATG 


TCTATCAATC 


CATACATACT 


CATTTATTAT 


78451 


ACGTTTTGAA 


AGCAACCAGT 


T AT AG TTTTG 


TTGCCATGGA 


TCATTTTTAC 


78501 


TATTCAGTAA 


ATCAGTCAAT 


TGAAGAGGCT 


TGATTTTATG 


GTATTAGTTT 


78551 


TTTGGAAACT 


GTCAGCTTTA 


TAGTAAATTT 


TGACATCTTA 


CAACTTCCAC 


78601 


TGAGATTTTT 


TTGCTTGACT 


AATCTGCCTT 


GATGCCAATA 


AG TAT ATT AA 


78651 


CGGAAATGGA 


CTAAAAGCAA 


ATGTGACTTG 


AAGCACAATT 


TTGTAAATTT 


78701 


TCTTAGTGTC 


TCAGTAATAC 


TTAAT AC TAG 


TGCATTTTAG 


G TAG G AAAAT 


78751 


TTTCAGTTTG 


TTTTATTTTA 


AATAA C TATA 


AATCTTATAG 


TTGCTTGTAT 


78801 


AAAAGAAACA 


GATACCTTTA 


ACATGATTAA 


ATATCAAATG 


CTATTCTCTT 


78851 


CAAAATATCT 


TAACTAAAGA 


AGCACTGCCT 


GCTCTTAGAA 


GTTAAGCAAG 


78901 


GCCATACCAT 


ATGCTGCGTA 


CATGGCTTTT 


AACACAATGG 


ATATTAGAAA 


78951 


CAGCCTAAGG 


CTGAGCCTGG 


CTCCACTATT 


TTTCAGCTAT 


GTGACCATGT 


79001 


GAAAGTTACA 


TTTAGTAATT 


AAACTCATTT 


C AG TAG T TTG 


CTTTAAGAAT 


79051 


AAAATTAGGT 


ACTCCGGGGG 


CATATCAAGC 


ATATTGTAAA 


ACCTAGTTTG 


79101 


ATTATTATTT 


GTTATTGGTA 


TT AC T ATT AC 


TATTCTATAA 


TAAGTCATGG 


79151 


GCAGGCAGTA 


GGGGTACATT 


GGAAGAATTG 


CACTGTCTTA 


AATATGTCCT 


79201 


CTGTTTAACT 


CACAAACTCA 


GTCTACCTAG 


GCTTTCTTTG 


GAGGATCTGC 


79251 


CTTTCATTGG 


CTGTTTGACT 


TTGGCCAAGT 


TACTTAACTT 


CTTTTCACTT 


79301 


CAGTTTCCTC 


ATCTGTGAGA 


TTATGTGCTT 


ACATGACTTC 


AGGTTTTGTT 


79351 - 


TTGGCTCTAA 


TATGGTATGA 


T TC T A TG AAA 


TGGAAAGTTA 


ATACATTTGG 


79401 


CTCTAGTAAC 


TGTATTTGAA 


GC AC AAA TAT 


TAAAAAGCAC 


AATTAATTCT 


79451 


CATTCTGAGT 


TTCCATTTAC 


TCTTTTAAAT 


TAATCATTCA 


GAATAAATCA 


79501 


TTTTGGAAGA 


GCTGCTTGAT 


CCAGGTATTC 


AGTAGAAATC 


ACTAGCATAG 


79551 


CATTTAATTT 


TAGACAAAAC 


TGAGAACTCA 


TTAAACTGCC 


AG GG C T ATG G 


79601 


ACTTATATGA 


GATTCTCATT 


AAATC TTAAT 


GTAGATAACT 


CAGTTAATTA 


79651 


AAACAAATAT 


GGTTGTACTT 


TATTAAACTT 


CTAAAGTCAA 


AACTGCATTG 


79701 


AAATTATCTG 


TACAAAGCCT 


TGTTGACCTT 


TATTAGAGAA 


CTGCCTCTCA 


79751 


AAAGACCTAA 


AAGACTTATT 


TGTTCAGATC 


GAGACTCTTC 


ATGAGCCAAT 


79801 


GTGATACTCT 


CCCTCTATTG 


CTAGATCTTC 


GCATCAGAAG~ 


ACAGCATTCC 








Fig. 2 


(cont r d 


44) 
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79851 


TCTGAAAGTG 


TTTCTAGTAT 


CAACAGTGCC 


AC AAG C C ATT 


CCAGTATTGG 


79901 


CAGTGGTAAT 


GATGCCGACT 


CCAAGAAGAA 


GAAAAAGAAA 


AACTGGGTAA 


79951 


GTTACCATCC 


TTCATCTAAT 


TCAGAAGCTT 


ATT AATG CAT 


AATG TG TT AG 


80001 


GCCTTTTTCT 


TTGGGGCTTT 


AGTGATCTGC 


AGTAGTTTAC 


AAAGGGTCCC 


80051 


ATTCAAGCTA 


CTGAGACCTC 


AAATGCTGCA 


CTCATCACCA 


AAATTGG AG T 


80101 


GGCATGTACT 


GAAAAGCATA 


CATTTTAATG 


TTGGGACTAA 


ACTTGGGTTT 


80151 


GAATCACCAC 


TATATCTAGA 


CCTTTTGAGG 


GGCCTGAATT 


TTCTAACCAA 


80201 


TAAAAAGACA 


GTTAATAGCA 


ACTATATTTA 


TTTGTGAATA 


TCATTTATTC 


80251 


ACAGATGTTA 


TCTAATTTTT 


C TAT AG TATA 


AC TAT AC AAA 


CTATG TAG TA 


80301 


TAACTATAGA 


GTTATACTAA 


AGAAAAATAA 


GATAACATCT 


GTGAATAAAT 


80351 


GGCTTAAAAT 


AGGGG TTT AT 


TGTGGGCATA 


GAGATGAAGG 


AAAAGTGAAA 


80401 


AAATGATGAT 


GATGGTGATG 


ATGATGGTGA 


TAGTGGTCTT 


GGAGGAAAAG 


80451 


GAGAATGGGA 


GTTAATAAAG 


GGAAAGAATA 


AACAATGAAA 


CTCTCATTCC 


80501 


ACCTTTGGAA 


TCGACAGGGC 


TTACCGTGTG 


AATAGTTTCA 


CCCTAAAAGA 


80551 


AATC AAC 0 AC 


ATTAGTGTCT 


GCTTGATGTT 


TTTAACCAAG 


AG AAT AT AG C 


80601 


AGAAATATAG 


AAATGCACTT 


TAACAGAACT 


GTACCTTAAG 


TTTGCTAGTG 


80651 


ATATAATTTA 


TGATATTGAT 


CAATAGCTAA 


ATAGCCCAGG 


GG AAG AT AC T 


80701 


GTTACTGCGA 


AAAATTTAAA 


AACAATGGAG 


TCAATGATTT 


CTTTTAATAC 


80751 


CAAAAAAAAA 


ATGTAGATTT 


TGAGTAAATA 


CAACTCTTGA 


TG AAATC C AG 


80801 


ACATAATTAT 


CAGAGGATTT 


TACTGGAGTG 


CTTTCTACAA 


ATAATGAAAG 


80851 


AAATATCTTT 


TTATCTTAAA 


AAATGTTTAT 


ACAGGTAATA 


TTTTAAAATA 


80901 


CTGATCAGCC 


TTCATTCCCT 


TGATTTGTAA 


TTCCACACTC 


TTTCATGTTT 


80951 


CTGCAAGGTG 


AACTCTAGAG 


GAAGTGAGGT 


GAAXATAAAC 


CGTGGACAAT 


81001 


TTGGCATGGA 


TXTATAAAAA 


AACCCTACCT 


TGGCATGAAT 


GCTATCCATT 


81051 


TTGGCAGTAG 


GCTTTTATAC 


CTTTTAAAAC 


AG ATT AC C TT 


blAlolLl 1 1 


81101 


TCTTTGTGTC 


TTTTCATTTT 


AATCTCAAAT 


TTTAAAGAGA 


TGTAAAACCA 


81151 


CTTTCTGAAT 


AGAGCTGTAG 


GGGATACCAA 


TTCTGGTTTT 


GAGTAGTCTG 


81201 


GGGTTGGAAA 


ATTTGAATAG 


AAAAATCACA 


ATTAATGAAG 


TGTTAGGTGA 


81251 


ATTTGATTTC 


ATTTTGCTTT 


TTAAGTTTGT 


ACTGTCAGCA 


GGACATGACT 


81301 


TGATTGTAGC 


GCTAAAGTGG 


CCATTTAAAA 


CAAATTGCCT 


TGAAGAGAGA 


81351 


AGCATTGGGA 


ATGGAGATC 














Fig. 2 


( cont 1 d 


45) 
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Human genomic sequence 



1 


GAATTCCTGG 


TGGAGAACAG 


CACATGTACA 


GATGGGGTGA 


G AAC AG C ATA 


51 


CGTACAGGTA 


GGGGTAAGCT 


GGTGCTATAT 


GAGAAAGCAT 


GGAATAAGTT 


101 


ATTAAGTTTG 


ACCTGCTTGG 


GAACTGAGGG 


GCAGGTGTGA 


GGGATGAAGC 


151 


AGGAGTAGGT 


AGGGGC TAG A 


TCACAAAAGA 


TCTATGCCAG 


TGTTTCTCAC 


201 


AGTGTGATTC 


CCAGCCCAGT 


AGCATGATAT 


C AC TTGGG AT 


CTTGTTAGAA 


251 


ATACAAATTC 


TTATACATCA 


CCCTGGACTA 


GACCACCTGA 


A TAAG AAAAG 


301 


TTGGGCATGA 


GGCCTACAAA 


TTTTTAAAAA 


AG TC AT AC AG 


GTGATTGCAA 


351 


TGCATGCTAA 


AGTTTGAGAA 


ACACTCTTTG 


CTGTGGTTTG 


AATATTTGTG 


401 


TCCTTCCAAA 


ATTCATGTAG 


AAACCATCTG 


CAATGTTATA 


G T ATTAAG AG 


451 


GAGGGACCCT 


TGGGAGCTGA 


TCAGATCATG 


AAGTCTCCTT 


TCTTATAAAG 


501 


GGGATTAAAA 


GCCTTGGCCC 


TTTTACCCTT 


TGTCCATGTA 


AGG AC AC AG T 


551 


GTTGGAAGCA 


GGGACTGGGT 


TCTCACCAGA 


AAC AG AAC C T 


GCCAGCCTCT 


601 


TGGTCTTGGA 


CTTCTCAGCC 


TCCACAATTG 


TGAGAAATAA 


GTTTCTGTTG 


651 


TTTATAAGTT 


AACCAGTCTC 


AGGTATTTTG 


TAATGGCAGC 


ACAAAGGGGC 


701 


TAAGAAACTG 


TTCTATGCCC 


TAACAAGAAA 


TGTGGTCACT 


TTCCTGAAGG 


751 


AAATGGGGAT 


ATATATAAAG 


ATGTTATATA 


AGACTCGTAA 


TATTTATTTG 


801 


GAAGGCTTGC 


TCTGCAAGCA 


AGGTGGAAGA 


GCAACATGAA 


GGAAGCGTGG 


851 


TGGAGGTGAG 


AGGACTGGAG 


GTTAAGTTGG 


TAGGGAGATA 


CAGGAAAGAA 


901 


GCTTATGACA 


CTTGAGTTAA 


AATGTAGCAT 


CCTTCCTATG 


TGTAGGGCTC 


951 


ATAAAAATGT 


ATAGTCTAAG 


ATAGAACACA 


GAATACTCTA 


TGAATCCTGC 


1001 


CCACAAGGTG 


TTGGTAATCT 


AGATTCACTT 


TTTTTTTCTG 


ATAATGCCAT 


1051 


CCATATGTAT 


GGAGCGTCTA 


CTACTGTATG 


CCAGAGTGAC 


TCTGGAATCG 


1101 


GTTTGGTTGA 


TCTAGACAAG 


ACCATAAGGA 


GAGTCCCCTT 


ACTACCTCTT 


1151 


CTCCAGGGGA 


GGGATTCAAG 


TTG AAC TAG T 


ACTTCAGAGA 


CTGTTTAGTA 


1201 


ATATCATGCA 


TGAAAGGTGA 


TGG TTAGG AC 


AGAAAAATAA 


ATGGATTGCA 


1251 "" 


TCATAATTCC 


TCAGGTTCTC 


CAAATATGTG 


GTGGTCTCAA 


ACCATGTGAA 


1301 


TTGGTCTGCA 


CATCCTGTTT 


GGGTTGCGTG 


TCAGCAGTTG 


AGATCTGAGC 


1351 


CTTATTTGTA 


ACAGTGAAAC 


AG TG AG AG AC 


CTGCCCTTCA 


AGAGCTGTTT 


1401 


TTCAGCTAGG 


AATAGAAAAG 


GGCCAGGCTA 


GACTCCTCTT 


TCTGCTGGAT 


1451 


CTTGCTTCTT 


CTCAGCAATA 


GAAGTAGACC 


TGCCTTCCTA 


GCTGTAGAGA 


1501 


AAAGGTGCCG 


GTAGGCGGGC 


AGGTGAGCCT 


GTGGATAATC 


CTGGAGTAAA 


1551 


GGTTCAATAG 


ACCTTCAAGT 


CTATCCTACA 


GGATTCGGAG 


TGAGGGGAGA 


1601 


GAAAAGGAGA 


CGCTTCTCTG 


GCTGAGAGAG 


GAAGAGAAAA 


AAAAATCCCA 


1651 


GATATCTGAC 


AGCTATATGT 


TCCCATCACC 


ACCTTCCTCT 


AAACCCATGC 


1701 


CTCTCTGTTT 


AGTAGGACAT 


AAAATGAAGA 


GTGACCCACC 


CCCCACCCCC 



Fig. 3 
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1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 
3451 



AGCCCATCCC CCGTTTGTAG GTGTGCTTTC AATGAAAATA AGTCGGTGTT 
CATGGACGGA AAC TAG AG C A GCTGAAAATA GATGCAAGAC TTGTTGAGCA 
TACAAATCAT TTCCCCCTTA GTCTCCAAGG GAGGAAAAAA AATCCCTCTT 
ACTCTCCTTG CAGCCTGTGT TCTGCATTCT GGAGAGGAAG CTGAGGCTGG 
TCCTCAGGCG CTCCTCCCGC CGTTCCCGCA GGAAACTTTT CTCGCAGGGC 
CCGCTCCGTC CATCCCGCGC GGTTCCAAGA CGGTGGGCCT CCCGTGGGCT 
CCTCTCCTGG GCAAGGGCCC AGACCCCGCG ACGCGCCTGT CTCTTTAAAT 
TCCAGCTGCG CGGCTGGGAA ACAGCGCCAC TCGCCGCCCA GGCCGGCTGG 
AGGCTGAAGA GCGAGCTCGC GCTTTCGCTC CCGGCTGCGC GCCGCGGAGA 
GCTGGGCTCG GCCCGCGGGC TGCTAGGTGG CGGCGGCGCG GGGCGGGGAG 
GCGCGGCCCG GCGGAGGAGG GAAGAAAGAG CGAGCCGGGC CGGGAGAGGC 
GCCGCGCCCG GTCCCGCGCC CGGTCCCGCA CCCGCTCTCA GCGGCCCAAG 
CAGTTTCTTT CTGGGTGACA AG AATGTGC C TCGGTTGGTT TTTCTTTTTT 
TTCTCCATCT CCTTAAGACG ATTTCCATAG TAACCTGATC AAGTGGCTCA 
AAATCGCAAA C C TG AG GAT T TCCGCGGCCC GCCGGCAAGA CCTCGGCCAG 
GTAACGCTGC GATCTCCTCC TCTTCCATTG CAAAC CGCTG CGCTCCTTGC 
AAAGTTCCTT TTGTGGAAAA TCGCCCAGCC CAAGGGAGCC CGGGGTATTT 
GCAACAGCGT GTTCATTTCC AGGTGCCTGT CACGGGTCTC CTCCCTGCTG 
CTTCTCCAGG ACCCATGATG AGATTATTTT TAAAAATTGT TTTTGGTCGT 
CTCCCCCGCC CCCTCCCCTT CTTTATTTTT TTCCTCTTCG CTGCACTCTT 
CTCGGCTTTT CCCCTGACAC T AC TG ATGGG GGTGCGGGGG GACGTCGGGG 
ATGGGGGTGG CCAGCGCGGT CCTGGGAGTG GCGGGTTCGG ATGGGCTGGC 
TGCGGTGGGC CACTTTGGGC ATCTCGGCGT GGCCTGCGCC GGGGTCACGG 
GGAGGGCTGT CAGCGCCAGG GCGGCGGAAC CCGAGGTCTC C AG AC G AG TG 
AGGGAGGGAT GCAGGCTTGG GGGTGATGGA GCGCTTGGCT GGTGGCTGGT 
GAGCGTCCAT ACATCATAGC TCTCCTTCCC ACTCCCCCGC CCCTCTTCGG 
GATTCTCTCT TTCTCTTTCC CCGTCCTCAT TTCTTTCTTC CTTTACTCAC 
CACTCGCTTC ATTCTCTTCC TTCCATTTCC TCTTTTTTTC TCCCCTCATT 
TCCTTTTTTT CCTTTCCCTT TTAAAGAAAG GGGAATCGTT TGTAACCCTT 
TCGTTCTACC AACGTGGAAT AGCTGTGAAA CCTGCAGCGT GGTCACCTCA 
GCCTGGTCGT TTTCAGACCC GTCCTCATCC ATCAACATAT TTG TT TCCCG 
AGTCTATTGA TCTCCCTGAA TTCTACAGAA ATGCATTCTA AGCTAGGCGC 
CTGTATGTCA GAATCAGTTC TGCAGGTAGC TTCCGTGCTC CAAGTATGAC 
ATGTATTGTA AGGGCTGCAT CTGTTTTAAA CCCACATAAG CCATGGGTAT 
AAATAAATGT AGCTTTGAAA AAAAATCTGG CCTTATTCTA GATAAACTTC 



Fig. 3 



(cont'd 1) 
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3 5 01 CCTCTTAAAT TACTGATATA CTCTTCTCCC TCTTTGACAT TTAATTTTAG 
3 551 G AAAG TTGGG AGACAGGTTC TTGTCCTCCA GTTTTTAAGG AGCAGGCAAC 
3 601 TTCTATTATC TTAATTTTCT CGTCTTTGAA CATCACTCAC GTTTGCACTA 
3 6 51 CCCAGTCAGT GGAACGAGTG GGTCATAATT AA 



Fig. 3 (cont ' d 2) 



O qii W ;;l H S H^Q^ 



54/124 

Human genomic sequence 

1 CCTGCATTAT TGTTTTTATC TGACTTCCAA 

51 GTGGGTTTTC CTGACACATT TACAAGATGC 

101 TTTGAAGGCA CATTTAATTG TAGGTGCAAT 

151 TTCTTGGTTT GAGATGTCAT GCCCTTTTGG 

2 01 CTGTGTGTGT GTGTGTATGT GTTTGTGTGA 
251 CTAACTGTCA TGTGATTTGA AGTTAAAAGG 

3 01 AATTTCTCTT ATTT AT AG T A CATTGATCCT 
3 51 GATTTCTTCT GTGCATGGAT C ATTT AAC G A 
401 ATTTTTTTAT AATAGGAAAA AAATTCCTCA 
451 TTTTTATACA GAGGGATTAG CAAATATTTC 
5 01 CTGAATTGCT GACCACTGCT AATTACCAAG 
551 AAATTGCTGT ACCTCTCAAG TTGTCTGGAG 
601 GTAACTCATG GCAACAGGAA GTGGTTGTTC 
651 CACATGGACC CGTACTTTGT TAGCACTCGG 
7 01 TAATGTACTG GCTTTTTTGT ATAGATGAAT 
7 51 CAGAACTACA TCTCCCAAGC CTTGTTTTGC 
801 ... T C AC TT AC AG TAAGTCCTTA CTT AAC TG AC 
851 GCAACTTTAA GCAAAAGGAA GTATAATGAA 
901 TTGG TAG AAA CAAGACTTAA GTTCCCATGG 
9 51 CATTTCCAAA CTTCTCAAAA CACTTCAATA 

1001 AAAC TATGT A TATATGTAAG AAAGGTTACT 

1051 CCCAATTATT TAAGTTCAGG GTCTTAGGTG 

1101 CTC AGGGC AC AAGGCGGG AA CCAGCCCTAG 

1151 AGGGC ACGTT CACACATGCC CACACGCAGG 

12 01 CAATTCACCT ACCATGCACA TCTTTGAGAC 
1251 CTGGAGAAAA TCCATACAGA TATGGGGAGA 

13 01 CAGTGGACCC AGC CAGGAAT CAACATTTGG 
1351 AGTTGAATGA AATGATGTCG TTCCACGACC 
1401 ATAAAATTCT CAGAAGACAG AGGTTTAATG 
1451 AACTTATAGA GAAGTGTGCA C ATGTG AC TT 
1501 AGGATGAGAA TCAGACGTAA GAGGTGGTGC 

15 51 TTCAGGGAGC TGTGGATGGA AGTAGAAGCC 
1601 CTACTTCTTA CTGTTTCAGT TATTGCAGTG 

16 51 TGTCACCTAG CTTTGTTTTC TCAAGAAGAA 

17 01 GTTATGAATT CTT AAAC AC A CAGGACATAA 

Fig. 4 



TTTTGGTGTT CCCTGGGTGG 
TTTTGGCAGG TTGGCTGGAA 
AAAAT ATTC A TTTTCTCTTG 
TCACTTATAT TTTGGTGTGA 
AGGATTTAAC AAAGTCTGTT 
TATGTTAGTG ACAAGCCACA 
GAAACCATTT TTTCCCTTGT" 
AAGGTTGGCA ATGATGAGCT 
AGTTT AC TT A CCAAGTCATA 
TGATCTAATA TTTTAA TAG A 
AATATATTTT CTTAATTCTG 
GACTCCAAGT GACCCAACTT 
TGGGTGCAAG CTGAAGTGTG 
GGACTTGATA TGGAAAGAAT 
GTTAAC TTTC TGACATTAGT 
AGTGTCTGTC CCTTTGCTCT 
TTGATAGGTT CTTGGAAACT 
ACACTTTTAT CACAGGCTAA 
CATATTTCTG GTCACAAAAA 
TTAAGCATTC AAATACATGT 
ATAAACCAGA TCAATATTTA 
GCTGGAGCCT ATCCGAGTAG 
ACAGGACACC ATCCTGTTGC 
CTGGGAC CAT TTACATGTGC 
GTGGCAGGAA GCAAGAGTAC 
ATGTACAAAC TCCACCCAGA 
GCAACATTAT AATGAAACGA 
TGCTGTACTT GAGGGGTGTT 
CTATCTTTTT AATAGAAAAT 
TGTGTGTAGC AGGAATCATT 
CAACATGAGG AATGTTGAGA 
AGAAGGCCAG GGTTAGGTTC 
TTGGCCTGTT TATTCACAGA 
AAATGAGCAT AATCTTTCCT 
CCACAGACAC AGAGGTGCAC 



55/ 124 



1751 


ATATGTAGCA 


GTAATGGATA 


CTAAATGATA 


CACTCGGAGG 


AAACAGAAAA 


1801 


GACTTCTGAA 


T AG AG AC TG G 


AGATACTTCC 


TTGGACCATT 


GATGAATGGG 


1851 


CAATGATGCA 


TTTTTGTCTT 


CCATTCAGAA 


GGCTAATATA 


TTGCTCTCTA 


1901 


TGTTCTATGG 


ATAAAGGCAG 


TATATGCTCA 


AGGATGAATC 


ACATAATATG 


1951 


CATAATAAAT 


CCAGCAAGCA 


TTACCCTTTT 


ACTTATGTGA 


CTGCAAGTAG 


2001 


GAATACATTT 


CCCCCACTCT 


AACCATGTAA 


GATTTCTTTC 


CCTTCTCCCA 


2051 


TTTTGTAAGC 


AAAAGTAAGT 


TCCTGAAAGG 


TTAAATGGAC 


CTCAGGATGG 


2101 


GAAAAATCCC 


CAGAGCTATC 


TTTCTGCACA 


GACTTCATTT 


TTTCTCCCAA 


2151 


GTCTGACTGT 


CAACTGCGAT 


ATCTGATATG 


AGGCTCTGGT 


GCTGATGTTT 


2201 


CCATAGGTCA 


TCATCCTTCG 


GTGTCCCAGA 


TGAAGTCTCA 


GGTCGAACAT 


2251 


TGCAATAGCA 


CAGATTCTGA 


ATTTAATG C A 


TCATTAAAGT 


TGGTTATGTA 


2301 


ACCCAATGGC 


CTTGTTAAAC 


TCCAGATTTT 


TAAAATTATA 


TGTATTTACT 


2351 


ATTCTCTTAT 


TTTAGAATGA 


TCTCACAATG 


TTCACAAGAA 


ATAAG CCC AG 


2401 


TCCCTGCAAA 


GACTTTAAAA 


GCTGCTTGTT 


C A CATC ATT A 


GATTGTACAA 


2451 


CGCTTGTACA 


ATGACACTTT 


TTGCTAATCT 


ATGCAACATT 


TTTGTAACAA 


2501 


TTGTGCACAT 


TTTAACTACT 


TCAGATAATC 


AGGACC T AG A 


GACTTCAAGA 


2551 


TCTGGAAGCA 


TTGCTGGTGA 


CATAGAGCAA 


AAACTTTCTT 


GAGAATAGGA 


2601 


AGTCAGTGTT 


TTGACAAGTG 


ATTTATAACA 


GTTCAGGTAT 


AGCCAGGAAG 


2651 


GTTTGAAACA 


AACCTTAAGT 


ATTATTTCTT 


TCATCTTGAT 


TAG TATAT AT 


2701 


TTATATGTGA 


TCTATTTATG 


TATATTAATA 


GATTTTTGGG 


TCTTATAGCC 


2751 


AGCTTTCATT 


TTTCTCTATT 


GGAAAAGATC 


TAAGTCCCCA 


TCCTTCCTTG 


2801 


GTGGCTTTTG 


GTAGGTTTGT 


AGACAAAACA 


TTGAAGAATC 


AATGGTACCT 


2851 


TTTATACATT 


AATACTGCCA 


ATATG AC C AT 


AAAATCATAT 


TTTTTGGGAA 


2901 


TTTATTCCCC 


CGATCAAAAG 


AAGCATTTGT 


TATTGAACAC 


AGTCTTATGC 


2951 


TACCTTATTA 


AGATGTATCA 


AACACCCTGA 


TTGATCAAAA 


ACACCTCAGT 


3001 


CCATTTTAAG 


GCAGTATTGC 


CCAGCAATTA 


AAGATGTAGC 


TTCTGGAGGA 


3051 


GTCTTTCTGA 


GTTTGAATTC 


AGTACTCTTC 


CACGTACTAT 


ATAGGTGATC 


3101 


TTGGGTAAAC 


TTCTTGAGTC 


TCAGTATCCC 


CATCTGTAAA 


ATTGTTGTAG 


3151 


AGAAGAATTT 


TTGTGATGAT 


TAGGTGAGAG 


AATATATTAA 


TGTAATATTT 


3201 


AGGAGAGCAA 


CCAGCATGTA 


GCATATATTC 


ATTACATATC 


AATTTCTATA 


3251 


TTATTGATGT 


TCATACTGCT 


GATGTTGAAA 


TGCACAGGAA 


GGCCACAGTT 


3301 


ATTTTCTGTT 


TAGATTGATT 


TTTCTTTTAA 


AGTCTGAACA 


TAAACTGTAA 


3351 


TACTGTGCTT 


ATTTATGTAG 


GAACTGTGAT 


CTCGTCTCCT 


CCTTTTCCCA 


3401 


TCTCCCCCTC 


TCTACCTTAG 


TTTTTCCTTA 


TAGTCTCAAG 


CTGAAAACAA 


3451 


TGACCAGGTG 


CCTAAGAGAT 


AAGAATACTC 


TTTCTTTTGA 


ACTCATGGCA 






Fig. 4 


( cont ' d 
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7 ^ m 

j JUI 






J. 1 VJuAVJUV. X 


A TT A "PTT^T 1 a A 


\j i vj/irW-v i ^ttAj^ 


■jcci 




ASAAPfA AP,P 




PT"P A PTT A ma 
V— 1 J. 1 


nb 1 1 


JOUl 






PAT A APAATC5 


APAPAAPAP& 


till 


3651 


TTGGGGAAAG 


GGTGGGAAGG 


GGGTGAGGGA 


TAAAAGACTA 


CAAATAGGGT 


3701 


GC AG TG TATA 


CTGCTTGGGT 


GGTGGGTGCA 


CCAAAATCTC 


ACAAATCACC 


3751 


ACCAAAGAAC 


TTACTCATGT 


AACCAAACAC 


CACCTGTTCC 


CCAGTAACCT 


3801 


ATGGATATAA 


AAAAATTAAA 


AAAAAGAAAA 


AAAGAAAACT 


CTTTTTTGCA 


3851 


GGGGGCAGGT 


AAAGGGTAAG 


AGGGCATCCC 


ATTTTTGAGT 


TTCTAGAAAA- 


3901 


GCTT 











Fig. 4 (cont'd 2) 



57/ 124 

Human genomic sequence 



1 


CTGCAGGAAG 


CAGCAGCAAG 


GTCCAGGGAG 


CCTCTAATTT 


AAATAGGAGA 


51 


AGTCAGAGCT 


TTAACAGCAT 


TGACAAAAAC 


AAGCCTCCAA 


ATTATGCAAA 


101 


TGGAAACGAA 


AAAGGTAAGT 


GTTTGTTACA 


TCATTATGAC 


ACAAGTCCAA 


151 


CATGAGTCTT 


GTGAATTGCA 


TGCTAAATCT 


AATATTTGAG 


CAGCGTAACA 


201 


ACTTTGGGCC 


TAGAGATGTT 


ATC AG TGG AG 


TTTCTTTATG 


TTTCCTAACT 


251 


GTCCCCTCCT 


GACTGCCAGC 


TTTCTTATCT 


GAAGAACATT 


TTAAACAAAT 


301 


AAACTCATTC 


ATTTTAAAGT 


AGTTAGTTAT 


ATATGCAAGT 


ACAAATACTG 


351 


TTTCTCAAAA 


ACAGGTCCTT 


CCAAATGCAT 


GTAAATCACA 


TTTTCTTATG 


401 


TCTTTTTATG 


TTTTTGAAAA 


TGTATCCTGA 


AATCATAAAG 


CCATATTGAA 


451 


TTTATCTGAA 


TCCTTAACTT 


CAGTTAAGGT 


AAGAGCCATA 


AGTGTTTTTG 


501 


ACAATTAAGG 


TTGGAGCATC 


AAAATTTGAA 


ACATAATTAC 


AGTAGGTTTT 


551 


TATCTTTGCA 


AGCAGCAGAT 


CCCAGAGATA 


TTATGACCTC 


AGTTTTCCCC 


601 


AAAAGACAAA 


TTATTCATAT 


TTGTTTTGTT 


TTCTTGAATT 


AG TGC AT AAT 


651 


ATAAATATCA 


AATCACAAAA 


TCAAGGACAT 


TAAATGAAAG 


TGTCTGTTAA 


701 


AGGCATATTA 


TAAATGAATC 


ATAAGCCACA 


CAGTTCTCTG 


TGATGTACGA 


751 


AGTGGGCATT 


TAAAGAGGTG 


CTGATTTGAT 


GCTTGTCACT 


GAGTAGCAGA 


801 


GAGGACGGGG 


ATGAGTATGT 


G T AGTTTAC A 


CCTCAATCAT 


GAGGAAGTGA 


851 


AGAACTTGTG 


CTGTTATAAG 


TAG TATGGC T 


GTGTGAGGAA 


CTAGGGTGTT 


901 


CTGCTGGATT 


TTGAGGAAGT 


ATTTTCAAAT 


CAATAGAACT 


TCAAACTTTT 


951 


CTTCAGAGTG 


TTGGGCTCTA 


CATGGAAAAA 


CACATGAAAT 


TAAAAAGTGG 


1001 


CACAAATGTT 


TAGTTAGTAG 


AACATCTGGC 


TAATTGGGAT 


CAAATAATTC 


1051 


AACCATGTGG 


GAACGTTTTT 


GCTCAAAATA 


GATAATTGTG 


AATTGTTTCA 


1101 


TATAGGCAAA 


TGATTAGACA 


ACTTCCTCTT 


CCTCAAATGT 


GAACGGACAG 


1151 


ATGTGATCTA 


GAAGCAAGAC 


ACTCTTTTGT 


GTAAATATTC 


CCTTTGGCCT 


1201 


AAAGCAAAAG 


TGG AC AG AC T 


TTAAACACCT 


GAGAGCAGAG 


CAGTGTGTGT 


1251 


TAAGATTGCA 


ATATCTTAAG 


CTCTTGAGTT 


AAATGGAAAA 


TGAAAAACAA 


1301 


AAGTGTATAT 


TTGGAAGTTA 


GGAATGTTTT 


CTTTAAAATA 


TAAAATAAAA 


1351 


TTTTAGATTT 


AAGATCACAA 


GAAATATTAC 


TGAAGACTTA 


TACTCTTCCT 


1401 


GGGGCTAAGG 


GAGGTGACAG 


TCGCTCATCA 


GAAAAAAAAA 


AATGCCCTCA 


1451 


TTTCCTAACT 


TTTCTAAAAA 


ATATAATACA 


AGTTCAGGCT 


AATACTTCCT 


1501 


GTATATGTGG 


GAAATTTCTA 


GGGGAAGCTA 


ACAGGCTTAG 


AAATAAAGAT 


1551 


GTGTTAAATA 


GACTACCAAA 


GTGTCCAATT 


AAGCAACACG 


ATACCACCGT 


1601 


TATTGATATT 


CTAGCAAGAA 


ATTACTAGCA 


ATGTTTGTAA 


AT AG AC TT AG 


1651 


AAATGCATTT 


GATGAATTAA 


CACTTTTATA 


TCTTAATTTA 


TCTGAATTTT 


1701 


TCTGTAATGT 


GAAAATGTTT 

Fig. 


TATTTAACTT 

5 


ATTTCTGGCA 


T C T ATT AG T A 
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1751 


AAATTCTGAT 


GAT AT AC AAG 


CATTAATATT 


TTTCCATGGC 


CACTCAATTC 


1801 


AT AC AT AC C T 


TCCCTATCTA 


TGCTTAGAAG 


GCAGTGCAAA 


ATTAGATAGT 


1851 


AGCAATATTG 


ATT AT AACC A 


CAAGGTGGAG 


ACAGATGTCA 


TGTAATATGC 


1901 


AGTCTGCTCA 


TATAAAGCAC 


ATTTTCTTAG 


ACAAGAGTTT 


TCATACGATA 


1951 


TAATAAAGAC 


ATCTGGAATT 


TGTCTTGTAT 


GCAATATGAA 


ATTTGCTATT 


2001 


AAACGTGGAG 


TTAAAACTTT 


ATGTCAATAG 


ATCCAATAAC 


AATGTTCATA 


2051 


AATTAATCAT 


TATGTCATGC 


TGTATTTCCA 


AAATACTATC 


TTAAATTATA 


2101 


AGAGCAAACG 


AGGTAATAA 









Fig . 5 (cont 1 d) 
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Human genomic sequence 



1 


GTACATTTTT 


TAATAAAGAT 


GTTTG TTTT A 


AC TTTTTG AA 


TATGAAGATT 


51 


TCTAGTTCTA 


GAATAATGTT 


TATAAAAATA 


TAC AAATC C A 


TC TGG TG ATG 


101 


AGTTGACCTC 


TATCACAACT 


AGTTTGCATA 


TATAACTTGG 


GTG TG AC C AA 


151 


GCAAGGTGAG 


AGTTAAGAAC 


TTTTAAAACT 


TACTGTATTA 


TATTGATAGA 


201 


ACTCAGAAAG 


TACTAACTTG 


AATATTATTA 


TTCTAATTGC 


TTTTCCCTTT 


251 


TAG TTATTAA 


AAATAAGAAT 


ACTTAAATTA 


ATAA CAAGAT 


CTTTTACTGG 


301 


CAGGATTAAC 


CAAATTATCT 


GTAATGTGTT 


CCTCGAATGC 


TTTTAAG TGG 


351 


AAATATACTT 


TATACATTCT 


TTAACAACTC 


TGAGAGGATG 


AGTTACATAA 


401 


ATCAGTTCAG 


G AATC T AT AG 


AATCTGTAAT 


AC AT AG TAAA 


GGTTTATTCA 


451 


CAATTAAAAC 


AATTTC AC TT 


C TAT ATT AAA 


AAAACAAATT 


GTTGAAAGTA 


501 


CAGTGGCTTT 


TCATATGTAT 


GATTTGTAAA 


ACAAATTAGC 


TTTTTTAAAG 


551 


TG ATGTG AC G 


CTTAATGAGA 


AGAAATCAGT 


AGAGAATTAC 


AAACTGCACT 


601 


TCAAAAGATA 


CATC T AATAT 


CATTTTAATA 


ATGAAATTTG 


AAAAAATAGT 


651 


GTGCTCGTTT 


TACAGTCTCA 


TTAAATGAAT 


TAAAATATCA 


GCACACATTG 


701 


TAGTAGGTTA 


TCATTGGCAG 


AGAAGGCTGA 


AATAGAAACG 


TTACAATGGG 


751 


ATGCACTGCC 


ATCTGAACAT 


TATGTCGAAG 


TGGAACGCGG 


AAACATATTT 


801 


CTCAGAACAA 


GTGGTAAAAT 


GAAAACAGCA 


TCATTTGTAA 


AGCATTTCTT 


851 


TTGAGAGTGC 


TTC AG TTTCT 


TCTCCTGATG 


ACCTGCC ATT 


CAGAAACTGA 


901 


CAATGAATAA 


TACACTCTGA 


CACCAGCATT 


TGTCAATTTG 


CCCAGAACCA 


951 


TATGAG AG T A 


CTCTAGACAG 


ATATATGTTC 


CGAAGTAAAC 


CGAATACCTG 


1001 


TTAACTGTAA 


ATCAAATCTT 


GTAGAAACCA 


TGCCATGGTT 


C C TTTGG AC A 


1051 


TAT AC TTTG C 


ATGCCTGAAG 


CAAGTTACCT 


TAAGAAATCA 


TTCTTTTGTT 


1101 


TTACAAAACT 


TGTATTAAAA 


AATTAAAAAT 


GCAAAAAAGC 


TTAATATTAT 


1151 


TAGGAATTTA 


TCCATAGCTT 


TATTTGGAAT 


CCAGTTTCTT 


TATTATGATC 


1201 


TATAAACATG 


CATCATTTGA 


TGGAGTTCCT 


TAG TGG AG AG 


GTGTTTTTCC 


1251 


ATGTTGCTAA 


GAAACATGCC 


CCAGCACCAG 


AAGGGATACT 


ACCTACCATC 


1301 


TTTTTGCCAT 


TTCTCACCGT 


GATTCTTACA 


TTGTACCTGT 


TT AC TC AC TG 


1351 


AACAGGGCTT 


CCTTCTCTTT 


GTC TAGATTC 


TAATCAGGTG 


TCTTCTGGTG 


1401 


TGGAAGCTTT 


GGCTTTTATT 


TACACACAAC 


ACAGAATTAA 


TAAGATAGAT 


1451 


GCCAAGGATT 


TAGCAACATT 


TTAATTCAAC 


ATTATACAGG 


TATCAGAGTT 


1501 


AATGAGAATT 


ATGCATTAGT 


CTTTAAATTT 


GGGCAGCTTA 


TTCAGCTAAA 


1551 


ACATAGATGT 


CTAGCTCTTA 


AACACTTTGT 


TTTTTTAATT 


ACTCTGAAAT 


1601 


TACAATAAAG 


TCAAAGAACT 


GAACTGTTTT 


CTTTTCAAGC 


CAGTGCAAAT 


1651 


GTGCTTTAGT 


TATTATTTTA 


CTGGTGATCT 


AATTATGCAT 


TTTAATGCTT 



Fig. 6 
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17 01 


T ATT AC TT AA 


T AC T TAT AT A 


AGCCTAAAAT 


ACGTTGTTAA 


TGTCATAATT 


1751 


TCAGGGATTT 


TAG TATTCTT 


TCCATGAGTT 


ACCATAACTA 


GGTGCATATG 


1801 


TGTAAATATA 


CGTATATATC 


TATATCTATA 


TATTTATATC 


TATGTATATA 


1851 


TC AATTT AT A 


AGACTAAATA 


GACTTGGCCA 


TATGTGTTGT 


TGGTTTATGC 


1901 


ATACATGCAC 


AAATATTGAG 


GTGTCCACAA 


AGTATATATG 


C C TG T AC AT A 


1951 


AATTACATAC 


TGGCTGGTGA 


GTGAATGTAA 


GCTTCTCTAA 


ATTGTACAAC 


2001 


TCTCCACAGA 


GTGGCACTCT 


AATATTGCAA 


AGGTACAATA 


TAAGCATGTG 


2051 


CAGAATGAAC 


AGCTCTTCTA 


GGATCCCTAT 


AAAAC TCC AC 


CCCATGTTTC 


2101 


TGT 











Fig. 6 (cont'd) 
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Human genomic sequence 



1 


AAGCTTCATC 


CCAGAGGGGC 


ACTTGCCAGA 


TGCCTGCTAG 


AGCTCTCCTG 


51 


TATGAGGAGT 


CTATCAACAC 


CTGCTGGGAG 


GTGTCTCCTC 


GTCAGGAGGC 


101 


ACGGGGGTCA 


GGGACCCACT 


TGAGGAGGCT 


GTCTGTCCCT 


TAGCGGAGCT 


151 


AGAACACTGT 


GCTCGGAGAT 


CCGCTGCTCT 


CTTCAGAGCT 


GGCAGGCAAG 


201 


AG TG TTTTAG 


TCTGCTGAGC 


CTGCGCCCAC 


AGCCGCCCCT 


TCCCCCAGGT 


z. z> ± 


GP Tf TG TP C C 


AGGGAGATGA 


GAGTTTTATC 


TGTAAGCCCC 


TGACTGGGGC 


"I A1 

J> \J A. 


TfiPTAPPTTT 


CTTTCAGATA 


TGCCCCGCCC 


AGAGAGGAGG 


AATCTAGAGA 


•5 CI 
J Jl 


X L, 1 VjVj 


PTAPAGPAGP 


TTTGCCAAGP 


TGCAGTGGGC 


TPTGPPP AGT 


*± U ± 


t-LAAnAI ILL. 


p a nrnnnTTT 

L.fHjLV7lJU XXX 


PTTTAP ATTH 

U i J. i ALft X X \J 


TP A PGPP AAA 


APPAPPTAPT 






rnrn t\ rpfZCir ' 7\ PT 1 
X 1 Ai V-rV^ X 


X uv, V- 1 — X \_ v_ v_ 


PPPAPP AAPP 


TPP A PPPTPP 


5 01 


r> t\ r^r~* T l /"" ,r P('~'f~ ,r P 

C ACslj 1 (j X L,L- 1 


1 L,/-lv_7/-i^_ X Vj\— X 


PTPPTPPP A A 


TPAPA ATTTP 


A AP.PP APTPP 


DDI 


.ft. X L, X lAvjLl X 


P.PTGGGPTPP 


A P A GGGG TGG 


GATPPAPTGA 


GPTAGAPPAP 


Oux 


1 X jHL^L, 1 


GGPTTPAGPP 


CCCTTTCCAG 


GTG AG TGG AT 


GGTTCTGTCT 


651 


P APTGGP ATT 


CCAGGTGCTA 


CTGGGGTATG 


AAAAAAAAAA 


CTCCTGCAGC 


7 01 


TAGPTTGGTG 


TCTGCCCAGT 


TTTGTGCTTG 


AAACTCAGGC 


CCTTGGTGGT 


7 51 


GTGGAPAPPP 


AATGGAATCT 


CCTGGTGTG C 


ATGTTGTGAA 


GACTGTGGGA 


8 01 


A A AGP AT AGT 


ATCTGGGCTG 


GATAGC TC CG 


TCCTTCAAGG 


CACAGTCCCT 


851 


P ATGAPTTPP 


C TTGG CTAGG 


GG AGGG AG TT 


CCCCAACCCT 


TTGC A C T TC C 


9 01 


CAGGTGAGGC 


AACACCCCAC 


CCTGCTTCTG 


CTCACCCTCT 


GTGGGCTGCA 


951 


CCCACTGTCT 


AATCAGTCAC 


TGTGAGATGA 


GCCTGGTACC 


TCAGTTGGAA 


1001 


ATGCAGAAAT 


CACCTGCCTT 


C TG TGTTG AT 


CTCACTGGGA 


GCAGCAGACT 


1051 


GGAGCTGTTC 


CTATTCAGCC 


ATCTTTCTCA 


GGTCATAATC 


ATAGATTTTT 


1101 


AATTGATCCC 


AGCAACATGG 


ATTAGTAAAC 


AGCATATTTC 


CAAGTGATTT 


1151 


T"T"TTTT A TTT 
x x x x x iai x x 


TAAGG TC AAA 


TCTACAAAAT 


ATTATAGTGT 


TATCACCACT 


1201 


T AAAATT AT T 


ACTGGTGATA 


CTATGTTTGT 


CTCTATTCAC 


ATTTTATTGC 


1251 


TAGAAAGAAT 


TATAATTTGT 


AG A TAATAAT 


AGTTATTTGA 


AATGTATTAC 


1301 


ATATCCTTTT 


ACTTTTAAGA 


AG AGGTG A C T 


TAATTATCTA 


GGTATACAAT 


1351 


TATTTTGAGG 


ATACTAAATG 


TCATGAATAG 


CAAATTTATC 


ATATTGCTTT 


1401 


CCTAGGTGAA 


GACCCTGAAA 


CAAGAAGAAT 


GAGAACAGTT 


AAAAACATAG 


1451 


C AG AC TTG AG 


GCAGAATTTA 


GAAGAGACTA 


TGTCCAGTCT 


TC G TGG G AC T 


1501 


CAGATAAGCC 


ACAGGTTTTT 


TTCAATTTTG 


CATATATTTG 


AGCCAATAAA 


1551 


GAAAAAATAA 


TTACAAACAA 


ACATTTAACT 


TTTCTTATAA 


TG AC AG AG AT 


1601 


GGGATTTCAG 


TTTCCCCTTA 


CTATTTTCTC 


CCTTGTTTTA 


TATCAAATTG 


1651 


ATTGGTAATT 


ATCCTTAAAC 


TGAGAATTCA 


C AG TAT AT AC 


CTATTTATCT 


1701 


TTTATCTCTA 


TCTCTATCTG 


CTATTTATGT 


CTTTTTCAGT 


ATAATTTCCA 
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1751 


GTACTGCAAC 


TACCACCATC 


ACTGTTAAGT 


GGATTTGTAA 


TACCTGTCCT 


1801 


AGAAAACAGT 


GGCACAAGTT 


GCACTTGAAA 


TGCATCTGGG 


CAGGGTAGTA 


1851 


GGGAGACATT 


CAAACATAAT 


TGTAGTTAAC 


TTTCAGAATA 


GGTCTGGGAA 


1901 


GGTTACAGTG 


AGTTAAGGAT 


TTGTTGAAAA 


TGTAAAACAA 


TATGTTGTTT 


1951 


TACCCAAGGT 


GTACTGATGG 


CCTTTCTTTT 


GAAAACAAAC 


GAAAAGCTAT 


2001 


AAAATG TATG 


CCCCTTTCCA 


C AATTTG AC C 


TCAAAATGAA 


TATAGAGTTT 


2051 


AGCTTTCGGG 


AAGATGACGT 


GTTTATAAGA 


GATGACCCTC 


AACTCCAGCC 


2101 


TTTTCTGTCT 


TCATGCATTC 


TAGATTATGG 


CCCTAAGTGA 


AC C AG AG TAT 


2151 


AGTTATTTCT 


CCATTTTATT 


TGACAGCACC 


CTGGAGACAA 


CATTTGACAG 


2201 


CACTGTGACA 


ACAGAAGTTA 


ATGGAAGGAC 


CATACCCAAC 


TTGACAAGTC 


2251 


GACCCACCCC 


CATGACCTGG 


AGGTTGGGCC 


AGGCATGTCC 


GCGACTTCAG 


2301 


GCGGGAGATG 


CTCCCTCCCT 


GGGTGCTGGC 


TATCCTCGCA 


GTGGTACCAG 


2351 


TCGATTCATC 


CACACAGACC 


CCTCGAGGTT 


CATGTATACC 


ACGCCTCTCC 


2401 


GTCGAGCTGC 


TGTCTCTAGG 


CTGGGAAACA 


TG TC AC AG AT 


TGACATGAGT 


2451 


GAGAAAGCAA 


GCAGTGACCT 


GGACATGTCT 


TCTGAGGTCG 


ATGTGGGTGG 


2501 


ATATATGAGT 


GATGGTGATA 


TCCTTGGGAA 


AAGTCTCAGG 


ACTGATGACA 


2551 


TCAACAGTGG 


GTAAGTAACC 


CTGTTCTCCG 


TCAGCATTGT 


GTGAAGAGGG 


2601 


GAGGTGGTCT 


ACTATAATGC 


ATTC AC TATA 


AACAAATGTG 


TAAGTTTGCC 


2651 


CAGAAAGTCA 


TGAGAACATA 


TGAGATATCT 


GAGGTTATTC 


AGAGTGTTGA 


2701 


AGGGCCCTTC 


CTCTGCTCAT 


TCATGGAGAG 


TAAAGAATCC 


AAGATTTCTA 


2751 


TAAATTCATT 


ATAAGCCGCT 


AAGTTTTTCT 


GTTGTTGAGA 


GAAACACATG 


2801 


TGGCTTCTGT 


TTTTC AG AG T 


GATTTTCACA 


TGCTTCTTAA 


G T AAC AG ATT 


2851 


TTGTAGTTAA 


GGACGTGGGA 


AGGAGACAGG 


AGG AG TTTTG 


CTGATTTGCT 


2901 


TGATTTTTTT 


TTTCTTTTTT 


AGCTTGTTAG 


AAGCGGCCTG 


TAACTGCTTT 


2951 


GAGAAACAAA 


TATTTTCTTA 


CTGTCTTCAA 


TTATGCATCC 


CCAATTTAAC 


3001 


TTGAGGGAAA 


AATCACTTTG 


GAGTTGAAAG 


TTTCACTCTA 


TTCATTTTCT 


3051 


TTTGATGGTA 


TCAGATTTCA 


ATACATCTCA 


GACCCTGTTT 


TTCTTCTGTG 


3101 


TCCTATTACA 


TTCCAAAACA 


TGTTGTGATT 


GTAAAACTCT 


TAGAGTATAT 


3151 


TAACAATTTG 


GGATATTTGG 


CATAATCAGA 


GAATAGGTCC 


AAAAGGAGGC 


3201 


AATAGGATAT 


TCTATTAATA 


ATTGTAATTG 


CCATTTTTAG 


CATTTCCTGT 


3251 


TATG TACTAT 


GCTCTTGTCA 


AGTGCTTTGA 


AGATAGTGTT 


TTACTTTTCC 


3301 


TTCCCACCAC 


CAGCAATGTT 


TATGAGGTAG 


ATGTTTTTAT 


ACATGTTCTA 


3351 


TGGATAAGGA 


AACTGAGTCT 


AATTGGCCCC 


GGCTGGGAAC 


TAACGCTAGG 


3401 


GAAACGGCAG 


ACCTGCATTA 


GAACTCAGCT 


ATGTCTGACT 


TCAAACACAG 


3451 


GCTCAGTAAT 


ATGTGGAAAA 


GCTTCCCAAT 


TAACTTTGTC 


TATAAACTTT 
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3501 


GTGTGAGTCT 


GGATTTTGAC 


TTACTCTTTG 


TCTTTACGCA 


TCTGAGAGGA 


3551 


CCCATGTAGG 


AAATAATTCT 


TCTATATAAG 


TGACCCTTCC 


TGACTTCATT 


3601 


CATGAAAAGC 


TTATGTTTGA 


AGGGTGACAC 


GACCTAAAAA 


AGAGTACAAA 


3651 


ATAGCTTTTG 


ATT AC ATTT A 


TAGCTTTGCT 


CTGATATCCT 


AATACCTACT 


3701 


AGTCCATTCC 


TGGTATC C AC 


CCTACCTGAC 


TTTCTAAAAA 


TTTAGAATTA 


3751 


TAG AGAC T AA 


TTATGATTAA 


TTAAGATAGG 


TTGTTGTTCA 


GTTGCCACTG 


3801 


GATTCAGAGT 


GCCTAGTTTG 


AATCTCTCCC 


ATTCACTATC 


TGTGGACCCC 


3 851 


TTCGG AAC C T 


AACGTATCCA 


AATTAGTTTT 


TGTC ATC TAG 


AATAAGGATA 


3901 


AAATTGTACC 


ATCTTCATGA 


AGTTGTTAGG 


ATCATCCACA 


AATTTTAGTT 


3951 


TGCGCAATGC 


TTGGCATGAT 


ACAAGCACTC 


AATAAATTTA 


TCATCTTCCT 


4001 


CTTTATCATC 


ACTATTACAT 


TTATTATCAT 


TAATAACCAT 


ACCAATTTTT 


4051 


GGTTGTTGTT 


AGTTATAATT 


ATCATTTTTG 


TATGTATTTA 


ACATAGCCTA 


4101 


GGAGGCAATG 


CCCAGTTCAG 


AAAACATAAT 


GGCAAAGCAA 


GAGTGTCTAA 


4151 


GGCACACTCT 


TTCTCCCATC 


TCTCTCTTCT 


TTCTTCTCCA 


TTCTTTCCAC 


4201 


TCTATCCCCT 


CTTCTCTTTT 


TTTTC TCAAT 


CTCCTTAGAT 


GTGGACATAT 


4251 


GTGTGAATTC 
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Human genomic sequence 



1 


TG TGGGTGTG 


GGTGTGAAGC 


ATGTGTATGT 


GTGTGTGTGA 


AGCATCTCCC 


51 


CACCTGTAAT 


GTAAGTCCAT 


GAGTGCAGAA 


TTTTTGACAT 


ATTCTTTACG 


101 


TGTTGAGTTT 


TAACAAATGT 


TTGTGGAGTG 


AATGAACAAA 


TTAATGAATA 


151 


TAGGCTATTT 


ATTAATTAGG 


CAATATAGTC 


ACATAGGCTG 


GCAATCGCAT 


201 


CTAATTAAAT 


AGAGTGGTAA 


ATGAGTTCCA 


GAAAGAACTA 


AGGTACTACA 


251 


AGGATG TTAT 


GAAAGAGAAA 


AATGAGTTAT 


GTGAAAAATA 


GGAGACAGTG 


301 


ATAAGAGGGA 


AAGAATCCCA 


AAGTGTGGGC 


CACATTTTGA 


AACTAATGAC 


351 


CTATTATTCT 


ATTATTGTTA 


GCTGAAAGTA 


GAAAACGTCA 


TGGGAGGGAA 


401 


TATCTGCTAG 


TTTTTGGTAA 


AGGATGTTGT 


GATGGCAGAA 


CCAAGAAATG 


451 


AACACAAGGT 


GACTTTGGTT 


TGGGGACAGT 


GGGATAATCA 


ACTCTCCTTG 


501 


CTCCATCAGG 


GCCCCAGACT 


GGGCTCTGGC 


AGAGGAACTC 


AGAACAACGT 


551 


AAAGACCTAG 


ATAGGTATCT 


AATAAATTGG 


GACCTGTGAA 


AACAGTGCCT 


601 


CTTAAAGTGT 


GGTACCTGGA 


CCAGCAGCAG 


CAGCAGCAGC 


AGCCATTGAA 


651 


ACTTCAT^GA 


AAGACAGATT 


CTCAGCTTCA 


TCCAAGACTT 


ACTGAATTAG 


701 


AATATCTCAA 


GGTAAGGCCT 


GGTAATCTGA 


GCTTTAACTA 


GCCCTCAAGG 


751 


TGATTCTTAA 


GTTCAAGCAT 


CACTATATTA 


AGTTGAACAA 


ATAGATGCCA 


801 


GGCCTATAAA 


TACATGTAAC 


G C C TAG C ATA 


AATATTTCAA 


CATTAAAAAT 


851 


GACATTTCAT 


AGTTCTTATT 


T AC C C T ATT A 


GCTGTGTTCT 


GTCAAGATAA 


901 


TGAGAATATT 


GATATGTTAG 


AAT AC AC TG A 


TGCACTAATT 


TTTAAATTAG 


951 


ATCAAATAAT 


GACTTGTTAT 


ACCTGAAATA 


AATTGGTTCA 


GCTTGGTAGA 


1001 


TGCAGTTTTT 


GAGAATTATA 


TAAGTCATTT 


TTAAAAGAAT 


AATTTTAACT 


1051 


TGAGCTGCTT 


GCATAAATTA 


AATTGCAAAA 


AGGTCATAGT 


ATAAATCCTC 


1101 


CTATTAGCAG 


AGATAGAAGG 


TTTTTAAAAA 


AATTACAGAT 


AAGTCTGAAG 


1151 


G TCTTTT AAA 


ATCTTATATT 


CAGGAAG TGA 


CTCGGGATGT 


ATATCATTTT 


1201 


AAAATACATG 


GTCTTAAATG 


TTGTAGTTGT 


ATGACTCTTT 


CAGTTAATTT 


1251 


■ AAAATAC TTC 


CTTCTATGAA 


AAATTGTTTC 


AAAAATTTTT 


CTAAATTCTG 


1301 


TTATCCATTT 


CAAGTAGGAT 


AGGCAAGAAC 


AGATATAAGA 


TACTACTTTT 


1351 


TTGTTCATGT 


TTACTAAAAA 


AAAAATTACT 


GTAATTGAGA 


TCATGTAAAA 


1401 


ACATGTTTCC 


TGTCTATTTG 


TCTTAACCTT 


TTAATCCTGG 


CACCTTAAAT 


1451 


TTGACATAGT 


AGGAATTAGA 


AGACAATTGC 


AGAAAATGTC 


AACTGGGGAA 


1501 


ATTTTATTCT 


ACTAAAAACT 


ATGTCCATAC 


AACATAGCAA 


ATCACATTTT 


1551 


AAAGGCCAAA 


AAGTCTTTCA 


TAGCAATTTT 


TCAGATTATT 


TTCAAAGCAT 


1601 


ATCTTCTCTC 


TGCTCCTGCA 


GCATGCCGTT 


GATTTTTCTG 


TTATGCAGTC 


1651 


ACATAAGTAA 


TTACATGTTT 


ACATGTCTAT 


TTCACTCATA 


GAACACGAAA 


1701 


CAGTTAAATG 


TAGAATAATA 


TCCAATCCAT 


CTTTTTATCA 


CCAGTAGCTA 
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1751 


G CAT AC TG T A 


GGAACTCAAT 


AAATATATCA 


GATAAATTGT 


GGAAATAACC 


1801 


ATATCAGCTT 


ATAACATATA 


G AAATG TG AG 


TTTAAAAAGA 


AAACAATTAT 


1851 


ACATATGAAA 


AAATTTTTAT 


ACCATTTTTT 


TAAAGACCTT 


TCAGATGTCA 


1901 


TACAGTTTGG 


ACTTTTCCAG 


TGTTTCTTGT 


ATCATGAGAC 


AAT AG TAG AC 


1951 


ATTGTAAATC 


AAAAATAGTT 


TTCTGGGGTT 


GTGTACATTT 


GAAAAAACTG 


2001 


AATATCATAT 


CTGTTCTTAG 


AGAGTAATGA 


TGGATATTAA 


CATATCAAAG 


2051 


GTACAGAGAA 


GTCTTAAAGT 


TCAAAGTAAC 


ATCTGCTTAA 


TTGTATTTAA 


2101 


TTCAGTGCTC 


CATGAGCTTT 


TTTATCACTG 


ATTCCCTCCC 


TTT^TTTCTCT 


2151 


TATGATAATA 


ATTAACTTGT 


TCCTGTAGCA 


TTTTAAGAAA 


TGTTGATTTA 


2201 


GTTGAATGCC 


TTCACTTCTC 


CAATATAATA 


GCAGAAACTC 


AGAAATATTT 


2251 


ATTTACCCAG 


AATCATGCAG 


CTAATAGTAC 


AAGGATTCAG 


GTCTTTTACT 


2301 


TCCTATTTTG 


TGGTTCCCAA 


CTACTTTTGC 


CAAAGGTCTT 


TTAAATAATA 


2351 


TGAAACATAT 


TAG TG ATTG A 


TTCATTATAG 


TAAATGGGTA 


AATGATAAGG 


2401 


CTTGCAATAA 


TTC AC TG AC A 


AGAAAGCTT 
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Murine cDNA sequence 



1 AAG C C AC AGC AC C C TGG AG AC AAC C TTTG AT AC G AC TGTG AC AAC TGAAGTG AATGG AAG 

SHSTLETTFDTTVTTEVNGR 

61 GG C C ATC C C C AAC C TG AC AAGCC G AC CTTC C C C C ATG AC C TGG AG ACTGGG T C AAGCG TG 

AIPNLTSRPSPMTWRLGQAC 

121 CCCTCGTCTACAGGCTGGAGATGCCCCCTCC ATGGGCGCTGGATATTCTCGAAGCGGTAC 

PRLQAGDAPSMGAGYSRSGT 

181 CAGCCGATTCATCCACACGGATCCCTCCAGGTTTATGTATACCACGCCTCTCCGCCGAGC 
SRFIHTDPSRFMYTTPLRRA 

2 41 TGC TG TC TCG CG TC TGGG AAAC ATG TC AC AAAT AG AT ATG AG CG AG AAAGC AAGC AG TG A 

AVSRLGNMSQIDMSEKASSD 

3 01 CC TG G ATGTG TC TTC TG AAG TGG ATGTTGGTGG AT AC ATG AGC G ATGG TG AT ATC C TTGG 

LDVS S EVDVGGYMSDGDILG 

3 61 GAAGAGTCTGAGAGCGGATGATATCAACAGTGGGTACATGACAGATGGTGGGCTCAACCT 
KSLRADDINSGYMTDGGLNL 

421 AT AT ACC AG AAGTCTTAAC CG AG TCC CGGA C AC AGC AAC TTC C AG AG ATGTC AT AC AG AG 
YTRSLNRVPDTATSRDVI QR 

481 AGGC GTTC AC G ATGTG AC AGTGGACGC AG AC AG CTGGG A TGACAG C AG TTCTG TG AGC A G 

GVK DVTVDADSWDDSSSVSS 

541 TGGCCTCAGTGACACACTTGATAACATTAGCAC AGATGACCTCAACACCACGTCCTCCAT 

GLSDTLDNI-STDDLNTTSSI 

601 CAGTTCTTACTCCAACATCACTGTCCCCTCCAGGAAGAACACTCAGCTGAAAACAGATGC 
S SYSNI TVPSRKNTQLKTDA 

6 61 GG AGAAA C GTTCG AC AAC AG ATG AG AC C TGGG ATAG TC C TG AGG AG C TG AAG AAAG C C G A 

EKRSTTDETWDSPEELKKAE 

7 21 GGGAGATTGTGACAGCCATGGTGACGGAGCCGCCAAGTGGAAGGGTGCTACTTCTGGACT 

GDCDSHGDGAAKWKGATSGL 

7 81 TGCTGAAGACTCGGAGAAGACAGGGCAGAAAGCCAGCCTGTCTGTGTCTCAGACAGGCTC 
AEDSEKTGQKASLSVSQTGS 

841 C TGG AGG AG AGG C ATG TC TG CC C AGGG AGG AACTC C AGC T AC AG C T AG GC AG AAAAC C AG 

WRRGMSAQGGTPATARQKTS 

9 01 C ACAAGTGC ACTCAAGACCCCTGGGAAGAC AGATGATGCC AAAGCTTCCG AG AAAGGG AA 

TSALKTPGKTDDAKASEKGK 

9 61 AACTCCTCTCAAAGGATCATCCTTGCAAAGGTCTCCTTCAGATGCAGGGAAAAGCAGCGG^ 
*" T PL KG S S LQR S P S DAG K S S G 

1021 GGATGAAGGGAAAAAGCCACCGTCAGGCATTGGAAGATCGACAGCCAGCAGTTCTTTTGG 
DEGKKPPSGIGRSTASSS-FG 

1081 AT AC AAG AAG C C AAG TGGTG T AGGGG C TTC C AC TATGATTACC AGC AG CGGTGCC ACC AT 
YKKPSGVGASTMITSSGATI 

1141 CACAAGCGGTTC AGCTACACTGGGGAAAATCCCCAAATCCGCTGCCATTGGTGGGAAGTC 
TS, GSATLGKI PKSAAIGGKS 

12 01 CAATGCAGGAAGGAAAACCAGCCTGGACGGGTCCCAGAATCAAGATGATGTTGTCCTGCA 
NAGRKTSLDGSQNQDDVVLH 

12 61 CGTGAGCTCGAAGACCACCCTCCAGTACCGTAGTTTGCCCCGCCCTTCTAAGTCCAGCAC 

VS SKTTLQYRSLPRPSKSST 

13 21 CAGCGGAATCCCTGGGAGAGGTGGCCACAGGTCGAGCACCAGCAGCATTGATTCCAATGT 

SGIPGRGGHRSSTSSIDSNV 
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13 81 CAGCAGCAAGTCAGCTGGGGCCACCACCTCCAAACTG AGAGAACCGACTAAGATCGGCTC 
SSKSAGATTSKLREPTKIGS 

1441 AGGGCGCTCG AGTCCAGTC AC TG TC AACCAAACAGACAAAGAGAAGG AG AAAG TAG CAGT 
GRSS PVTVNQTDKEKEKVAV 

1501 GTCAGATTC AGAG AGCGTTTCCTTGTCAGGTTCCCCCAAATCCAGCCCCACCTCTGCCAG 
SDSESVSLSGSPKSSPTSAS 

1561 TGCCTGTGGG ACTCAAGGGCTCAG AC AGCCAGGGTCC AAATATCC AGATATTGCCTCGCC 
ACGTQGLRQ PGSKY PD I AS P 

1621 CACATTTCGAAGGTTGTTCGGTGCCAAGGCAGGCGGCAAATCTGCCTCCGCACCTAATAC 
TFRRLFGAKAGGKSASA PNT 

1681 TGAGGGGGCGAAGTCCTCCTCAGTAGTGCTCAGCCCTAGTACCTCTTTAGCCCGACAAGG 
EGAKSSSVVLSPSTSLARQG 

1741 CAGTCTGGAGTCACCGTCGTCCGGTACGGGAAGCATGGGCAG TGCTGGTGGGCTGAGTGG 
SLESPSSGTGSMGSAGGLSG 

18 01 CAGCAGCAGCCCTCTCTTCAATAAACCCTCAGACCTAACTACAGATGTTATAAGCTTAAG 
SSSPLFNKPSDLTTDVISLS 

18 61 TCACTCCTTGGCTTCCAGCCCAGCGTCGGTTCACTCTTTCACATCCGGTGGGCTTGTGTG 

HSLASSPASVHSFTSGGLVW 

1921 GGCTGCCAATCTGAGCAGTTCCTCTGCCGGCAGCAAGGACACTCCAAGTTACCAGTCCAT 
AAN LSSSSAGSKDTPSYQSM 

19 81 GACTAGTCTCCATACGAGCTCTGAGTCCATTGACCTGCCCCTCAGCCATCATGGCTCCCT 

TSLHTSSESIDLPLSHHGSL 

2 041 G TC TGG AC TG AC C AC AGG C AC TCACGAGGTGC AG AGCCTGCTC A TG AG AACGGG TAG TGT 
SGLTTGTHEVQSLLMRTGS.V 

2101 G AG ATC T AC TC TC TC AG AAAG AT AC ACC C C ATC ATC TCGG C AGGC C AAC C AAG AAG AAGG 
RSTLiSERYTPSSRQANQEEG 

2161 CAAAGAGTGGCTGCGATCGCATTCCACTGGCGGGCTGCAGGATACTGGCAACCAGTCTCC 
KEWLRSHSTGGLQDTGNQSP 

2221 CTTGGTCTCCCCTTCTGCCATGTC ATCGTCAGCCACCGGAAAATATC ACTTTTCCAACTT 
LVSPSAMSSSATGKYHFSNL 

22 81 GGTGAGTCCCACCAACCTCTCCCAGTTTAACCTGCCTGCACCCAGTATGATGCGCTCCAG 

VSPTNLSQFNLPAP SMMRS S 

23 41 CAGTATCCCCGCCCAGGACTCCTCCTTCGACCTCTATGATGATGCCCAGCTTTGCGGTAG 

S IPAQDSSFDLfYDDAQLCGS 

2401 -TGCAACTTCCCTGGAGGAAAGGCC ACGGGCCGTTAGCCACTCCGGCTC ATTCAGAGACAG" 
A.TSLEERPRAVSHSGSFRDS 

2 4 61 CATGGAGGAAGTTCATGGCTCTTCACTGTCATTGGTCTCCAGCAC ATC ATCCCTTTACTC 
MEEVHGSSLSLVSSTSSLYS 

2 521 TACGGCTGAAGAGAAGGCTCATTCAGAGCAAATCCATAAGCTACGGAGAGAACTGGTTGC 
TAEEKAHSEQIHKLRRELVA 

25 81 CTCCCAGGAGAAAGTCGCTACCCTCACGTCTCAGCTGTCAGCAAATGCTCACCTTGTAGC 
SQEKVATLTSQLSANAHLVA 

2 641 AGCTTTTGAAAAGAGTTTAGGGAATATGACTGGCCGTTTGCAAAGTCTAACCATGACAGC 
AFEKSLGNMT'GRLQSLTMTA 

27 01 GGAAC AAAAGGAATCTGAGCTTATCG AACTGCGGGAAACC ATTGAAATGTTG AAGGCCCA 
E.^Q'KESELI ELRETI EML KAQ 
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27 61 G AAC TCTG C TGC CC AAG C AG C C ATT C AGGG AG C AC TG AATG GC C C AG AC C AC C C T C C C AA 
NSAAQAAIQGALNGPDHPPK 

2 821 AGATCTCCGCATCAGAAGACAGCACTCCTCTGAAAGTGTTTCTAGTATCAACAGCGCAAC 
DLRIRRQHSSESVSSINSAT 

2881 GAGCCATTCCAGCATTGGCAGTGGTAATGATGCTGACTCCAAGAAA 
SHSSIGSGNDADSKK 
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Murine genomic sequence 



1 


GGGATGAAGG 


GAAAAAGCCA 


CCG TCAGGCA 


TTGGAAGATC 


GACAGCCAGC 


51 


AGTTCTTTTG 


GATACAAGAA 


GCCAAGTGGT 


GTAGGGGCTT 


CCACTATGAT 


101 


TACCAGCAGC 


GGTGCCACCA 


TCACAAGCGG 


TTCAGCTACA 


CTGGGGAAAA 


151 


TCCCCAAATC 


CGCTGCCATT 


GGTGGGAAGT 


CCAATGCAGG 


AAGGAAAACC 


201 


AGCCTGGACG 


GGTCCCAGAA 


TCAAGATGAT 


GTTGTCCTGC 


ACGTGAGCTC 


251 


GAAGACCACC 


CTCCAGTACC 


GTAGTTTGCC 


CCGCCCTTCT 


AAGTCCAGCA 


301 


CCAGCGGAAT 


CCCTGGGAGA 


GGTGGCCACA 


GGTCGAGCAC 


CAGCAGCATT 


351 


GATTCCAATG 


TCAGCAGCAA 


GTCAGCTGGG 


GCCACCACCT 


CCAAACTGAG 


401 


AGAACCGACT 


AAGATCGGCT 


CAGGGCGCTG 


GAGTCCAGTC 


ACTGTCAACC 


451 


AAACAGACAA 


AGAGAAGGAG 


AAAGTAGCAG 


TGTCAGATTC 


AGAGAGCGTT 


501 


TCCTTGTCAG 


GTTCCCCCAA 


ATCCAGCCCC 


ACCTCTGCCA 


GTGCCTGTGG 


551 


GACTCAAGGG 


CTCAGACAGC 


CAGGGTCCAA 


ATATCCAGAT 


ATTGCCTCGC 


601 


CCACATTTCG 


AAGGTAAGGG 


TATGTAAAGA 


GATGTTGGGA 


AAACATAAAA 


651 


GGTAGTATAT 


AGCATGTATT 


TATTC TGTAC 


GAAACTATTT 


TCATGTATTC 


701 


TAAATATTCT 


AAGATTCTGT 


ATCTTATACT 


TGTCTAAAAT 


ATAGTGATVTT 


751 


TATTTTGCTG 


ATTGCACCTG 


TTGCTAGTGT 


AAAAGCATTG 


CTCATTTAGA 


801 


GAGTGGTTAG 


CCTTTCAGCT 


ATACAGCCAG 


TGTGACACTA 


AAATACAGAT 


851 


ACCACTTGTA 


GCGGGCATAA 


AACCACATGA 


C TG AC TATTC 


ATAGAAATAA 


901 


AGTGATAGCT 


TGTAAAGATA 


TTTAGTGATT 


TCCACCTCTC 


CTTTCCAGAA 


951 


TTAAAAAAAG 


CAAATTGCAT 


AGATCTTTAT 


AAACACATTT 


AC TTC TAG TG 


1001 


TATG TTATCT 


TGTTGACTCT 


TAATGAAATG 


GCAGTTATGA 


ATATAGATGA 


1051 


TATATTCTTT 


CTAACAGTTT 


ATAAGAGACC 


AATTTATACA 


GTACCAGATC 


1101 


TTAACATAGT 


AACAATAACA 


GCAACAAAAA 


CAACCCAAAA 


AGCTATCAAA 


1151 


GTATGGTCTG 


ATTGCAGAAT 


TTGAAAACAT 


TTACATGTTT 


GACATAGGAC 


1201 


AAGAACTCAG 


GAGTGAGGTG 


AC TTTTT AT A 


AGTCTTCATC 


AATGTCCTTT 


1251' 


TACAGGAACC 


AGGAAGCATA 


TCTGATATAT 


GTGTCAGGAT 


TATCACTTTA 


1301 


TTAATTATGT 


GAAATTCTGT 


TTAGAAATCT 


ACCTGATTTT 


AAAT AC TTT A 


1351 


ATATAGTAGG 


GGTCAAAATT 


AGTTAATGAG 


TTAAGACAAG 


TTGTTAAATA 


1401 


ATCCTGGCTC 


TGTTTTCTCA 


TCTTCAAAAT 


GATAGAGTAT 


AATTTATCAC 


1451 


CTCTTGTTAA 


ATATTTCAGG 


TTTGTGTTTA 


TTCTCTTGAT 


AACTTTGATC 


1501 


TCTTAGAAGA 


GTCTTGAAGA 


ATTTACATTA 


AGTAATCTTA 


GAAACATAAC 


1551 


TATTTGAGAA 


ACAGTAGTCA 


AATTTTGTCA 


TTAGAAGTAT 


TAACTCTGAA 


1601 


GAATGATTTG 


AAGTGACAGT 


TCTTAGAAAG 


AATAAATTAT 


AGCTTGTAGC 


1651 


AAGAGTAAAT 


ATTTTCACTG 


CTTGTGTGAG 


AGCCAAGAGC 


GCCCTCTTGT 


1701 


G G C C C ATT AC 


CTATGAAACA 


ATTTCTCATA 


TTCGCCCTAG 


AAATCTTCCA 
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1751 


CTGCAGGAAA 


TAATGGATTT 


CATTGCCTCT 


GAATTAGTAA 


CCATTCTGCC 


1801 


ATTTCTTCAT 


ACCATTTTAT 


TTCCATACTT 


GCATAAATTT 


GATTATGTCA 


1851 


TCTGCTTCAT 


TTACAAAACT 


AAAATG TTTT 


CTGAGCTAAA 


CTCCAGTAGC 


1901 


TAACTTAGTA 


CAAATGGTAT 


TTTTAAATCA 


CTGCTATAAG 


TATATATATT 


1951 


TGAATAGCTC 


TGGCAACGGA 


CGGAAATCCC 


TATGGTCTTT 


CCATGGGAAG 


2001 


ATACAAACCA 


ATCCATAAGT 


TGTCCAGCAA 


TATCCAATAT 


TTCCAGCCCA 


2051 


GCC AGTCAGG 


CCTCTTAAAC 


ATTACCTTAC 


ATATTTGAAC 


CTTTCCTTAA 


2101 


ATGTCCCCTT 


T AG AC AATC T 


ATTTTTTAAA 


AAGATGAAAA 


TCCATTTAAG 


2151 


CATCATATAT 




oAnu J. J. w i J. X 


PATTATAATG 


GTTCTGCAGA 

\J X X w X WViiWX* 


2201 


TAGGTAATGC 


CAAAACGGCC 


AAAATATTTG 


ATCACTAGAA 


GCGTAAAAGT 


2251 


CAAGTACAAT 


CATGTTGACT 


TTTTTTCCAA 


GGTGGGTTCA 


CTGCTGCCCA 


2301 


CCTTGGTTCC 


AGGCCAGTGC 


TTACTTAAGA 


TATCGTAAGT 


GATTTTTTTT 


2351 


TAATTTTTAA 


TTTTTTAGTA 


GTTGGTTAAT 


CAAAAGCCAG 


TCATGTCACC 


2401 


TTCAGGAACA 


TAGAGGCTGG 


ACGTGCTTGG 


CAGCTCACGA 


CTCCAAAGCA 


2451 


CACTTGGCTC 


TGTGGACTGA 


AACCCTAGGA 


AACGTGGATG 


TGAGTCTCTT 


2501 


GGAACAACTC 


AAGTTGTTAT 


TTGTTTTTCT 


TTTAGGTTGT 


TCGGTGCCAA 


2551 


GGCAGGCGGC 


AAATCTGCCT 


CCGCACCTAA 


TAC 
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Homologous human cDNA 



1 GGATCAGCTTCGGGAGACCATGCACAACATGCAGTTGGAGGTGGACCTGCTGAAAGCAGA 
DQLRETMHNMQLEVDLLKAE 

*1 GAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATCAGGCTCCACTCCAGGGCAGGTCCC 
ND RLKVAPGPSSGSTPGQVP 

1 1 1 TCGATCATCTGCATTATCTTCCCCACGCCGCTCCCTAGGCCTGGCACTCACCCATTCCTT 
LZX GSS ALSSPRRSLGLALTHSF 

1 P 1 CGGCCCCAGTCTTGCAGACACAGACCTGTCACCCATGGATGGCATCAGTACTTGTGGTCC 
lt5X opsLADTDLSPMDGISTCGP 



AAAGGAGGAAGTGACCCTCCGGGTGGTGGTGAGGATGCCCCCGCAGCACATCATCAAAGG 
KEEVTLRVVVRMPPQHI I K G 



1 n 1 GGACTTGAAGCAGCAGGAATTCTTCCTGGGCTGTAGCAAGGTCAGTGGAAAAGTTGACTG 
DL KQQEFFLGCSKVS-GKVDW 

f, 1 GAAGATGCTGGATGAAGCTGTTTTCCAAGTGTTCAAGGACTATATTTCTAAAATGGACCC 
KMLDEAVFQVFKDYISKMDP 

AGCCTCTACCCTGGGACTAAGCACTGAGTCCATCCATGGCTACAGCATCAGCCACGTGAA 
ASTLGLSTESIHGYSISHVK 



481 ACGAGTGTTGGATGCAGAGCCCCCCGAGATGCCTCCTTGCCGTCGAGGTGTCAATAACAT 
RV LDAEPPEMPPCRRGVNNI 

ATCAGTCTCCCTCAAAGGTCTGAAGGAGAAATGCGTCGACAGCCTGGTGTTCGAGACGCT 
SVSLKGLKEKCVDSLVFE TL 

601 GATCCCCAAGCCGATGATGCAGCACTACATAAGCCTCCTGCTGAAGCACCGGCGCCTCGT 
IPKPMMQHYI SLLLKHRRLV 

fi 6 1 cctcTCGGGCCCCAGCGGCACGGGCAAGACCTACCTGACCAATCGCTTGGCCGAGTACCT 
LSGPSGTGKTYLTWRLAEYL 

791 ggtggagcGCTCTGGCCGTGAGGTCACAGAGGGCATCGTCAGCACCTTCAACATGCACCA 
VERSGREVTEGIVSTFNMHQ 

7 81 gcagtcttgcaaggatctgcaactgtatctttccaacctagccaaccagatagaccggga 

QSCKDLQLY^SNLANQIDRE 

841 aacaggaattggggatgtgcccctggtgattctattggatgacctgagtgaagcaggctc 
tgigdvplvill ddlseag s 

901 CATC agtgagttggtcaatggggccctcacctgcaagtatcat7^aatgtccctatattat 
iselvngaltckyhkcpyii 

9 61 aggtaccaccaatcagcctgtaaaaatgacacccaaccatggcttgc acttgagcttcag 

GTTNQPVKMTPNHGLHLSFR 

1021 gatGTTGACCTTCTCC AACAACGTGGAGCCAGCCAATGGCTTCCTGGTTCGTTAGCTGAG 

MLTFSNNVEPANGFLVRYLR 

10 81 GAGGAAGCTGGTAGAGTCAGACAGCGACATCAATGCCAACAAGGAAGAGCTGCTTCGGGT 

RKL VESDSD1NANKEELLRV 

1141 GCTCGACTGGGTACCCAAGCTGTGGTATCATCTCCACACCTTCCTTGAGAAGCACAGCAC 
L DWVPKLWYHLHTFLEKHS^T - 

Fig. 11 
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12 01 CTCAGACTTCCTCATCGGCCCTTGCTTCTTTCTGTCGTGTCCCATTGGC ATTGAGGACTT 

SDFLIGPCFFLSCPIGIEDF 

12 61 CCGGACCTGGTTCATTGACCTGTGGAACAACTCTATCATTCCCTATCTACAGGAAGGAGC 

RTWFIDL-WNNSI IPYLQEGA 

1321 CAAGGATGGGATAAAGGTCCATGGACAGAAAGCTGCTTGGGAGGACCCAGTGGAATGGGT 
KDGIKVHGQKAAWEDPVEWV 

13 81 CCGGGACACACTTCCCTGGCCATCAGCCCAACAAGACCAATCAAAGCTGTACCACCTGCC 

RDTLPWPSAQQDQSKLYHLP 

1441 CCC ACCC ACCGTGGGCCCTCACAGCATTGCCTCACCTCCCGAGGATAGGACAGTCAAAGA 

PPTVGPHSIASPPEDRTVKD 

1501 CAGCACCCCAAGTTCTCTGGACTCAGATCCTCTGATGGCCATGCTGCTGAAACTTCAAGA 
STPS SL.DSDPLMAMLLK.IjQE 

1561 agctgccaactacattgagtctccagatcgagaaaccatcctggaccccaaccttcaggc 
aanyiespdretildpnlqa 

1621 AACACTTTAAGGGTTCGGCAATCACTGTCACCCCCGGACAGCAGAACGCTGGCATCAGCT 
T L * 

16 81 ATCTTAGCTCCTCCTCTCCCCTCTCCTCTTTCAGAGCACTGGCTCTCCAGCCCCAGGAGG 

1741 AGAAC AGGAGGGAGGAGGAGATGAAAGAGGAGGGACAGGTTCTTGGTGCTGTACCTTTGA 

1801 GAACTTCCTAGGAAGGAATGGTGGGGTGGCGTTTGGGAACTTGTGCCCCCTAAACACATT 

1861 TACTGGCCTCCTCTAATGACTTTGGGGAAAAGATGATTCTGGGTCTTTCCCTTGACTTCT 

1921 TGTTTC AATTACAAACTCCTGGGCTTTCTGGGGAGGGGTTCAGAAAACATCAAAACACTG 

1981 CAGCAGTTCCTAAATGATTCTCACAAGCAACCCTGAGAGAGACAGTCTTGTGAGGGAGAT 

2 041 CTGGGGGAGGCAGGAAGCTCCTCAGATTTTCTCACAGACCCTTCCCAATTCC ATCACCAC 

2101 TGCCAACAACTCCTCCCCC AGAGATCTGGCTGGAGCCC AGAAAAAGAAGCATGTGGTTTA 

: 2161 AAAAATGTTTAAATCAATCTGTAAAAGGTAAAAAT 

2221 AAAAAAC AATGG AAAAG ATGAAGCTGG AGAGAGAGG AACC AGTTGCCAAGGTAGAG AGCT 

22 81 GCCCGCTCCTGCCCTCTGGATG ACATAGGGGACATCAACAAGACGGCTGCCAACCTGAGA 

23 41 AGTCACCAAACCACAAAAATAACCTTACAGCCTTCAGGGAAAGACTACCAGCTCTGTCTT 

24 01 TCTACCCTCTAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAAAA 



Fig. 11 (cont'd) 
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Homologous murine cDNA sequence 

1 G AACTATGGGAAAAAGAGATGAAGCTCACGGATATCCGGTTGGAGGCCCTCAACTCTGCC 
ELWEKEMKLTDIRLEALNSA 

61 CACCAGCTGGACCAGCTTCGGGAGACCATGCACAATATGCAGTTGGAGGTGGACCTGCTG 
HQLDQLRETMHNMQLEVDLL 

121 AAAGC AGAGAATG ACCGGCTGAAGGTTGCCCCCGGCCCCTCCTC AGGCTGC ACTCC AGGG 

KAENDRLKVAPGPSSGCT PG 

181 CAGGTCCCTGGGTCATCGGCTCTGTCGTCCCCTCGACGTTCCCTGGGCCTTGCACTCAGC 
QVPG S SAL S S PRRSLG LAL S 

2 41 C ATCCTTTCAGTCCTAGTCTCACAGACACAGACCTCTCACCCATGG ATGGCATCAGCACC 

HPFSP-SLTDTDLSPMDGIST 

3 01 TGTGG TTC AAAGG AAG AGG TG AC CCTG CGGG TGGTGG TC CGG ATGC CGCCC C AGC AC AT C 

CGSKEEVTLRVVVRMPPQHI 

3 61 ATCAAAGGGGACTTAAAGCAGCAGGAGTTCTTCCTGGGTTGCAGCAAGGTCAGTGGCAAA 
IKGDLKQQEFFLGCSKVSGK 

421 GTTGACTGGAAGATGCTGGATGAAGCCGTTTTCCAAGTGTTCAAGGACTAC ATTTCTAAA 

VDWKMLDEAVFQVFKDYIS K 

481 ATGG ACC C AG C C TC AAC C C TGGG AC TG AG C AC TG AGTCC ATAC ATGGC TAT AGC C TC AGC 

MDP.ASTLGLSTESIHGYSLS 

541 C ACG TG AAACG AG TGC TGG ATGC TG AGC C C C C AG AGATGC C TC CTTGCCGCC G AGG TG TC 

HVKRVLDAEPPEMPPCRRGV 

601 AATAACATATCAGTCGCTCTCAAAGGTCTGAAAGAGAAGTGTGTCGACAGCCTGGTGTTC 
NNISVALKGLKEKCVDSLVF 

661 G AG ACGC TT ATC C C C AAG C CC ATG ATG C AG C AC TAC ATC AGCC TC C TG C TC AAG C AC CGG 

ETLIPKPMMQHYISLLLKHR 

721 CGCCTGGTGCTCTCCGGCCCCAGTGGCACCGGCAAGACCTACTTGACCAATCGGCTAGCC 
RLVL SGPSGTGKTYL TNRLA 

7 81 GAGTACCTGGTGGAGCGCTCCGGCCGCGAGGTCACGGATGGCATCGTCAGC ACTTTC AAC 

EYLVERSGREVTDGIVSTFN 

841 ATGCACCAGCAGTCTTGC AAGGATCTGC AACTGTACCTCTCCAACCTAGCCAACC AGATA 

MHQQSCKDLQLYLSNLANQI 

9 01 GACCGGGAAACAGGGATAGGGGATGTGCCCTTGGTGATCCTCCTGGATGATCTGAGTGAA 
DRETGIGDVPLVILLDDLSE 

9 61 GCAGGCTCCATCAGTGAGCTGGTCAATGGGGCCCTCACCTGCAAGTATCAC AAATGTCCC_ 

AGSI SELVNGALTCKYHKCP 

1021 TACATTATAGGTACCACCAATCAGCCTGTAAAAATGACACCCAACCATGGCTTGCACTTG 
YI IGTTNQPVKMTPNHGLHL 

10 81 AGC TTC AGG ATGCTGACCTTCTCG AAC AATGTGGAACC AGCC AATGGCTTTCTGGTCCGT 

SFRMLTFSNNVEPANGFLVR 

1141 TACCTGCGGAGGAAGTTGGTAGAGTC AGACAGTGACGTCAATGCTAAC AAGGAAGAGCTG 
YLRRKLVESDSDVNANKEEL 

12 01 CTTCGGGTGCTGGACTGGGTGCCCAAGCTGTGGTATCACCTCCACACCTTCCTGGAGAAG 
LRVLDWVPKLWYHLHTFLEK 

12 61 CACAGCACCTCGGACTTCCTCATTGGCCCTTGCTTCTTCCTGTCCTGTCCCATTGGCATC 

HSTSDFLIGPCF FLSCPIGI 

13 21 GAGGACTTCCGGACCTGGTTCATTGACCTGTGGAACAATTCCATCATCCCCTATCTACAG 

EDFRTWFIDL.WNNSI IPYLQ 
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13 81 GAAGGAGCCAAGGATGGGATCAAGGTTC ATGGACAGAAAGCTGCTTGGGAAGACCCGGTG 
EGAKDG I KVHGQKAAWEDPV 

1441 GAATGGGTCCGAGACACTCTTCCCTGGCCGTCGGCCC AACAAG ACC AATC AAAGCTCTAC 
EWVRDTLPWPSAQQDQSKLY 

1501 CACCTGCCCCCGCCTTCTGTGGGCCCCCACAGCACTGCCTCACCCCCGGAGGACAGGACA 
HLPPPSVGPHSTASPPEDRT 

1561- GTCAAAGACAGC ACTCC AAACTCCCTCGACTCAGATCCCCTGATGGCCATGCTACTGAAA 
VKDSTPNSLDSDPLMAMLLK 

1621 CTCCAAGAAGCTGCCAACTACATTGAGTCACCAGATCGAGAGACTATCCTGGACCCCAAC 
LQEAANY I ESPDRETILDPN 

1681 CTQC AGGCGAC ACTCTGAGGGCCCGGC AGTC ACTGTCACCCTGGAGGGC AGAAGGCTGGC 
L Q A T L * 

1741 TTC AGC ATC ATTAGCTCTCCTCTGCCCTCTTCCTTCATAGCTCTGGCTCACCAGCCTCGC 

1801 CAAGAGAACAGGAGGGAAGAAGAGGGCAGGAGGAGGGATGGGTTCTCGGTGCTGAACCTT 

1861 TG AG AAC TTC C TAC TAGG AATTGG AGGGGG TGGAGTTTG AG AAC TC C GTGC C C C TT AACT 

1921 ACATTTGCTGGCCTCCTCTTACGACTTAGGAGAAAAGATGATTCTGGTCTTTTCTTC AAG 

19 81 TTTTG TTTC AC CT AC AAAC TC TTGGGC TTTC TGGGG AGGG ATTC GG AAG ATAT AAAC AG A 

2 041 C AAAC AAAAAC AAAC AAAC C AACTAC AG C AG TTC C AAGCTCGTTC TC AC AAAC AC C TCTG 

2101 AGACAGTCAC ATGTGGGCAAATCTAAGGGAGGCAGGAAGCTCTACAGACTTTCTTGCAAA 

2161 CCCTTCCCAGTTCTGTCGACACTGCCAACAACCTCCCCGCCAGAGACCTGGCCAGAGCCA 

2221 AGAAAAGAGAAGC ATGTGGTTTAAC AGAAAAAC AAAAC AAAACAAAAC AAAAAATATATG 

2281 TG TAAA TC AA C CTG TAG AAGGT AAAAACGGC AATGG AAAAG ATG AAGC TGG AAGG AGG GG 

23 41 CCCAGTTGCCAAGATGGAACGAGAGCTGCCAGATCTTGCCTTCTGGATGACAAGAGGGGA 

2401 CATTGCAAGATGGCTGCCAGTCTAAAACGTCACCAGACCACAAGAGTAACATCACAGCCT 

2461 TCG AAG AAAGGCC AC AAGCTGTCTTTCTGCCCTCTAACTG AAC ATG C ATG AAAAG TCAAT 

2 521 AAACCCTACTTTTTAATTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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T2 Murine cDNA with following intron 



CCAATAGAACTCCGGATCAAGAGGCAGAATTCCTCAGATAGCATCTCCAGCCTCAACAGC 

1 + + + + + + 60 

a PIELRIKRQNSSDSISSLKS 

ATC AC C AGC C ATT C C AG C AT CGGCAGC AG C AAAGATG CTGATG CCAAGAAGAAAAAG AAG 
61 + + + + + + 120 

a ITSHSSIGSSKDADAKKKKK 

AAGAGTTGGGTATGTAAAGGCTTGGGGATCGGCCTGTGCTAGGAGTCACTCACCCTGTTG 

121 + + + + + + 180 

a K S W 

CAGGGAACTGACCCCTTTCAGGATCAACAAAGAGGGTCCCTTCTAACAGGATGCCAGTGT 
181 + + + + + + 240 

TGTGACATCTGCTGGGGACAAAAATTCACTAAGTTCCCATTCCTCTATCCATTGTCTATT 
241 + + + + + + 300 

CTCCTTACCACCGCCCTGCACATATACCCCAGCCCCCCACCGTCCCTGCATCCTTTATAC 
301 + + + + + + 360 

ATGTCTGCTATCCTGGGGCTCTACCTACTGATGAGGTCAAATGTATTTGGCCGTAGAAGG 
361 + + + + + + 420 

AGCTGAGAAAATTATTCATGGGTGGGAGAGTGGGGCATGTGGAGAGAATTTGTAAGCCAA 
421 + + + + + + 480 - 

GCAGGGTACTCTAGACGCTCCTGGGGCTGTTGCTTTAGTTTGGGTGAGGAGGCTGTGGAA 
481 + + + + + - + 540 

CGTCCCCATCGCTCCAAAGCCTGCTTTTGTCTGGTCCAGAGGTGGGTTTGTTCTGTGTGG 
541 + + + + + + 600 

TATCCCCCCTGTAACTCTAAACTGGCTTTGGGTGAGCTTTCTACAATCTGTACGCAGGTG 
601 + + + + + + 660 

TAGGGCACTGCCTGACTGACTGAAAGGGAGAGTGACCCAGAGTGAGCGTCTTGTCCCTGT 
661 + + + + + + 720 

CCCTGCTGAGGAGGGCTGGCTACAGACTTTGGCCTAGTGCAGACAGGAGCCAGCTGTGTG 
721 + -- + + + + + 780 

GAGAAGCAGCTGTGTGAAATGCATGAGTAGTGTCGCTGCTGCTGCTGCTGCTGCTTTCTT 
78! + + + + + + 840 

TTCATTGTTTTTTTTTTTTTTTCTTTCCTTTTATTTCCTTCAAAATGCTGACCTCAAATC 

CCTATTTTTTTTCCAGGTTTATGAGGTAAGAAACTCGGTTCCTCTCCTCGTGGTTTTTCT 
901 + + + + + + 960 

TTCCCTTTGCACACCTTCGTGTTTCCAGAGCAAGCACCTCTCTTAAAAAAAAAAAAAAAA 
961 + + + + + + 1020 

AAAAA 2 
1021 1025 - _ * 

Fig. 13 
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splicing variant x (JFC410) 

1 AGCGAGTTACTCACGCTTCCCCTCCATCGGAAGCCAGCCAGGCCAAAACCCAGCAAGATA 
RVTHASPPSEA5QAKTQQDM 

6 1 TGCAGTCCAGTCTGGCAGCCAGATATGCAACTCA.GTCTAATCACAGTGGAATTGCAACCA 
Q S SLAARYATOSNHSGIATS 

121 GTCAAAAAAAGCCTACTAGGCTTCCAGGGCCCTCTAGGGTGCCTGCTGCAGGAAGCAGCA 
O K K P T RLPGPSRVPAAGSSS 

191 GCAACCTCCACCCAC CCTCTAATTTAAATAQGAGaAGTCAQAQCTTT^CAGCATTGACA 

KVQGASNLNRRSQSFNSIDK 

241 AA 

bp 1 corresponds to bp 914 of : THC 

underlined sequence represents further splicing form and is not 
shown in the THC sequence 



Fig. 14 
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splicing variant 2 



1 GGCACTCACGAGGTCCAGAGCCTGCTCATGAGAACGGGTAGTGTGAGATCTACTCTCTCA 
GTHEVQSLLMRTGSVRSTLS 

6 1 GAAAGATATACCCCATCATCTCGGCAGGCCAACCAAGAAGAGGGCAAAGAGTGGTTGCGT 
ERYTPSSRQANQEEGKEWLR 

12 1 TCTCATTCTACTGGAGGGCTTCAGGACACTGGCAACCAG 
S-TISTGGLQDTGNQ 



bp 1 corresponds to bp 3300 of THC 

underlined base pairs -> position of the differentially spliced exon which 
lacks here but is shown in the THC sequence 
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T2-CDNA sequence and T2 protein encoded therein 

CCGCGGGGCTTCCATCCTTCCTTTGACTGATTTTTAAATTTTAATTTGTATTTTCCCCGC 

1 + + + + + + 60 

RGAS ILPL TDF* ILICIFPA- 

CGCCCCGCCCCTTTTCCTCCGACCCCGCCCTATCGCTCCCCGGCTTCCCTGCTCTTTCCT 

61 -4- + + + + + 120 

APPLFLRPRPIAPRLPCSFL- 

TTTTCCCGGCTTCCTTCCTCGCGTTTCTTTCCCCTGCGCCCTCGGCTTGCCTCTCTCCCT 

121 . + + + + - 4- 180 

FPGFLPRVSFPCALGLPLSL- 

CCTCCCTCGCTCTCTCCCCCTTCTCTCCCCTTCTTCCTCGGTTTCTTCCGTCCTCTCTCT 

181 + + + + + + 240 

LPRSLPLLSPSSSVSSVLSL- 

CCCCCTCCTCCTCCCCCGCCTCCTCCTCCTGCGCTCCCGCCCCCTGCCCCCTCCCCCCGT 

241 + + + + 4- 4- 300 

PLLLPRLLLLRSRPLPPPPV- 

GCCTGCAGAGGCGCGGATCGTCCATGCGCTCCTCGCGGGCAGAATGCTGGGCAGCAGCGT 

301 + + + + + + 360 

PADARIVHALLAGRMLGSSV- 

CAAGAGCGTGCAGCCCGAGGTGGAGCTGAGCAGCGGCGGCGGCGACGAGGGCGCGGACGA 

361 + + + + + + 420 

KSVQPEVELSSGGGDEGADE 

ACCGCGGGGCGCCGGCAGGAAGGCGGCAGCGGCGGACGGCAGAGGCATGCTGCCCAAGCG 

421 + + + + 4- + 480 

PRGAGRKAAAADGRGML.PKR 

CGCCAAGGCGCCCGGCGGCGGCGGCGGCATGGCCAAGGCCAGCGCGGCTGAGCTGAAGGT 

481 + 4- + 4- + 4- 540 

AKAPGGGGGMAKASAAELKV- 

CTTCAAGTCCGGCAGCGTGGACAGCCGTGTCCCCGGCGGGCCGCCCGCCTCCAACCTGCG 

541 + + + + + + 600 

FKSGSVDSRVPGGPP ASNLR- 

CAAGCAGAAGTCACTCACCAACCTCTCTTTTCTCACGGACTCCGAGAAAAAGCTGCAGCT 

601 4- + 4- + 4- 4- 660 

KQKSLTNLSFLTDSEKKLQL- 

TTATGAGCCCGAATGGAGCGACGATATGGCCAAGGCGCCCAAAGGCTTAGGCAAGGTGGG 

661 + + + + + + 720 

Y E P E W S D D M A K A P K G L G K V G 

GTCCAAGGGCCGTGAAGCTCCGCTGATGTCCAAGACGCTGTCCAAGTCGGAGCACTCGCT 

721 + 4 + + + + 780 

SKGREAPLMSKTLSKSEHSL- 

CTTCCAGGCCAAGGGCAGCCCGGCGGGCGGCGCCAAGACCCCCCTGGCTCCGCTCGCGCC 

781 + + + + + 840 

FQAKGS PAGGAKTPLAPLAP 



Fig. 16 
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CAACCTGGGAAAGCCGAGCCGGATCCCTCGAGGACCCTATGCGGAGGTCAAGCCGCTCAG 

841 + + + + + + 900 

NLGKPSRI PRGPYAEVKP ETS 

CAAGGCGCCTGAAGCGGCCGTGAGCGAAGATGGCAAATCGGACGACGAGCTGCTCTCCAG 

901 + + + + + + 960 

KAPEAAVSEDGKSDDELLSS 

CAAGGCCAAGGCGCAAAAGAGCTCTGGGCCTGTCCCCTCTGCCAAGGGCCAGGAGGAGCG 

961 + + + + + + 1020 

KAKAQKSSGPVPSAKGQEER- 

CGCCTTCCTCAAGGTGGACCCCGAGCTGGTGGTGACCGTGCTGGGAGACCTGGAGCAGCT 

1021 + + + + + ; + 1080 

AFLKVDPELVVTVLGDLEQL- 

GCTCTTCAGCCAGATGCTGGACCCAGAGTCCCAGAGAAAGAGGACAGTGCAGAATGTCCT 

1081 + + + + + + 1140 

LFSQMLDPESQRKRTVQNVL - 

GGATCTCCGGCAGAACCTGGAAGAGACCATGTCCAGCCTGCGAGGGTCCCAGGTGACTCA 

1141 + + + + + + 1200 

DLRQNLEETMSSLRGSQVTH- 

CAGCTCCCTGGAGATGACCTGCTACGACAGCGATGATGCCAACCCACGCAGCGTGTCCAG 

1201 + + + + + + 1260 

SSLEMTCYDSDDANPRSVSS 

CCTCTCCAACCGCTCGTACCCTCTGTCATGGCGCTATGGCCAGTCCAGTCCGCGGCTGCA 

1261 + + + + + 1320 

LSNRSYPLSWRYGQSSPRLQ- 

GGCTGGTGAGGCGCCCTCTGTGGGTGGGAGCTGCCGCTCGGAGGGGACGCCSGCCTGGTA 

1321 + + + + + + 1380 

AGDAP SVGGS CRSEGT P AWY 

CATGCACGGCGAACGGGCCCACTACTCCCACACCATGCCCATGCGCAGCCCCAGCAAGCT 

1381 + + + + + + 1440 

MHGERAHYSHTMPMRS PSKL 

CAGCCATATCTCCCGCCTGGAGCTGGTCGAATCCCTGGACTCGGATGAGGTGGACCTCAA 

1441 + + - + + + + 1500 

SHISRLELVESLDSDEVDLK- 

GTCCGGCTACATGAGCGACAGTGACCTCATGGGCAAGACCATGACGGAGGATGATGACAT 

1501 + + + + + + 1560 

SGYMSDSDLMGKTMTEDDDI 

JCACTACCGGCTGGGATGAAAGCAGCTCCATCAGTAGTGGACTCAGCGATGCCTCAGACAA 

1561 + + + + + + 1620 

TTGWDESSSISSGLSDASDM- 

TCTCAGTTCAGAAGAATTCAATGCCAGCTCCTCACTCAACTCCCTCCCAAGTACTCCCAC 

1621 + + + + + + 1680 

LSSEEFNASS SLNSLPSTPT 

TGCTTCTCGCAGGAACTCAACAATAGTGCTACGCACAGACTCAGAGAAGCGCTCACTGGC 

1681 + + + + + I 740 

ASRRNSTIVLRTDSEKRSL^A- 
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AGAAAGTGGGCTGAGCTGGTTTAGTGAATCAGAGGAGAAAGCCCCTAAAAAACTGGAGTA 

1741 + + + + + + 1800 

ESGL SWFSESEEKAPKKLfiY- 

CGACAGTGGTAGCCTGAAGATGGAACCTGGGACTTCTAAGTGGCGGAGGGAGCGGCCTGA 

1801 + + + + + + 1860 

DSGSLKMEPGTSKWRRERPE- 



GAGCTGTGATGATTCATCCAAGGGTGGAGAACTGAAAAAGCCCATCAGCCTGGGCCACCC 

1861 + + + + + + 

SCDDSSKGGELKKPISLGHP 



TGGTTCCCTGAAGAAGGGCAAGACCCCACCTGTGGCTGTAACTTCCCCCATCACTCACAC 

1921 



+ + + + ---- + 1980 



G SLKKGKTP PVAVT S P I THT 

AGCCCAGAGTGCCCTCAAAGTCGCAGGCAAACCTGAGGGCAAAGCTACAGACAAGGGTAA 

. . + + + 2040 

1981 + + + 

AQSALKVAGKPEGKATDKGK- 

GCTTGCAGTGAAGAATACTGGGCTCCAACGCTCCTCCTCTGATGCTGGTCGGGACCGCCT 

. + + + + 2100 

2041 + + + _ „ 

LAVKNTGLQRSSSDAGRDRL- 

GAGTGATGCTAAGAAGCCCCCCTCGGGCATTGCTCGCCCCTCCACTTCGGGATCCTTTGG 

. + + + 2160 

2101 + + + 

SDAKKPPSGIARPSTSGSFG- 

CTACAAGAAGCCTCCTCCTGCCACAGGCACAGCCACTGTCATGCAAACTGGTGGTTCAGC 

2161 + + + + + + 2220 

YKKPPPATGTATVMQTGGSA- 

CACTCTCAGCAAGATCCAGAAGTCCTCAGGCATCCCTGTCAAGCCAGTAAATGGGCGCAA 

! , -\ 1 1- 22 8 0 

2221 t~W + k iqkssgipvkpvngrk- 

GACTAGCTTAGATGTTTCCAACAGTGCAGAGCCAGGATTCCTGGCTCCTGGAGCCCGTTC 

, + + + + 2340 

2281 f- 1 ' ^ 

TSLDVSNSAEPGFLAPGARS - 

TAACATCCAGTACCGCAGCCTGCCCCGGCCAGCCAAGTCAAGTTCTATGAGCGTGACCGG 

+ + _ + + + + 2400 

2341 ~n"i~~g~ + Y RSLPRPAKSSSMSVTG- 

CGGGCGGGGTGGACCTCGCCCTGTGAGCAGCAGCATTGACCCCAGTCTCCTCAGCACCAA 

. + + + 2460 

2401 H **" 

GRGGPRPVSSSIDPSLLSTK- 

.GCAGGGAGGCCTTACGCCTTCCAGACTGAAGGAGCCTACCAAGGTAGCCAGTGGGCGGAC 

+ + + + 2520 

2461 ~q"g"g~ + l~W~s rlkeptkvasgrt- 



CACTCCAGCCCCTGTCAATCAGACAGATCGGGAAAAGGAGAAGGCCAAAGCCAAGGCAGT 

+ + + + + + 

TPAPVNQTDREKEKAKAKAV 

GGCCTTGGACTCAGACAACATCTCCTTGAAGAGTATTGGCTCCCCAGAAAGTACTCCCAA 

+ + + + + + 

ALDSDNISLKSIGSPESTP/-K 

Fig. 16 (cont 1 d 2) - 
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GAACCAAGCAAGCCACCCCACAGCCACCAAGCTGGCAGAGCTGCCACCAACCCCTCTCAG 

+ + + + + + 

N QASHPTATKLAELPPTPL.R 



GGCCACAGCGAAGAGCTTTGTCAAACCACCCTCACTAGCCAATCTTGACAAGGTCAACTC 

2701 + + + + + + 2760 

ATAKS FVKPPSLiANLDKVNS - 

CAACAGTCTGGATCTACCATCATCCAGTGATACCACCCATGCTTCAAAGGTCCCAGATCT 

2761 + + + + + + 2820 

NSLDL.PSSSDTTH ASKVPDL- 



GCATGCTACAAGCTCAGCATCTGGGGGCCCTCTCCCTTCCTGCTTCACCCCCAGTCCGGC 

H + h + h b 

H ATSSASGGPLPSC. FTPSP^ 

ACCCATCCTCAATATTAACTCAGCCAGCTTCTCCCAGGGCCTGGAGCTAATGAGTGGTTT 

+ + + + + + 

PILNINSASFSQGLELMSGF 

CAGTGTGCCAAAAGAGACCCGCATGTACCCCAAACTCTCAGGCCTGCACAGGAGCATGGA 

+_-- + + + + + 

SVPKETRMYPKLSGLHRSME 



GTCCCTCCAGATGCCAATGAGCCTCCCCAGTGCCTTCCCCAGCAGTACTCCCGTCCCCAC 

3001 + + + + + + 3060 

SLQMPMSLPSAFPSSTPVPT* 

CCCACCTGCTCCCCCTGCTGCTCCCACAGAAGAAGAGACGGAAGAGCTGACTTGGAGTGG 

3061 + + + + + + 3120 

PPAPPAAPTEEETEELTWSG- 



AAGCCCCAGAGCTGGGCAACTGGACAGTAATCAGCGGGATCGGAACACTCTTCCCAAGAA 

3121 + + + + + + 

SPRAGQLDSNQRDRNTLPKK 



AGGGCTC AGGTACC AGCTTC AGTC C CAGGAGGAGAC CAAGGAGAGGCG ACATTC CC ATAC 

+ + + + + + 

GLRYQLQ S QEETKERRHSHT 



CATTGGTGGGCTGCCTGAATCCGATGACCAGTCAGAGCTGCCTTCTCCCCCTGCACTTCC 

3241 + + + + + + 

IGGLPESDDQSELPSPPALP 



CATGTCTCTGAGTGCAAAGGGCCAACTTACCAACATAGTGAGTCCCACTGCGGCCACCAC 

3301 + + — + + + 3360 

MSLSAKGQLTNIVSPTAATT- 



-GCCAAGAATCACCCGCTCCAACAGCATCCCCACCCACGAGGCGGCCTTCGAGCTGTACAG 

+ + + + + + 

PRITRSNS IPTHEAAFELYS 



CGGCTCCCAAATGGGGAGCACCCTGTCCCTGGCCGAGAGACCCAAGGGAATGATTCGGTC 

, + + + 3480 

3421 + + + + 

GSQMGSTLSL.AERPKGMIRS - 

AGGATCCTTCCGAGACCCCACGGACGATGTTCACGGCTCAGTGCTGTCCCTGGCCTCCAG 

. + + ^ + 3540 

3481 + " + ' + + 

GSFRDPTDDVHGSVLSLAS'-S - 

Fig. 16 (cont'd 3) " ■ " _r- 
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TGCCTCCTCCACCTACTCCTCAGCTGAGGAGAGGATGCAATCTGAGCAAATCCGGAAGCT 

3541 + + ; + + + + 3600 

ASSTYSSAEERMQSEQIRR^L- 

TCGTAGGGAACTGGAATCATCCCAGGAAAAAGTGGCCACCTTGACGTCTCAGCTTTCTGC 

3601 + + + + + + 3660 

RRELESSQEKVATLTSQLSA- 

CAATGCTAATCTGGTGGCTGCTTTTGAGCAGAGCCTGGTGAATATGACATCCCGCCTGCG 

3661 + + + + + + 3720 

NANLVAAFEQSLVNMTSRLR- 

ACACCTGGCAGAGACGGCCGAGGAGAAGGACACTGAGCTGCTGGATTTGCGAGAAACCAT 

3721 + + + + + + 3780 

H_LAETAEEKDTELLDLRETI- 

AGACTTTCTGAAGAAAAAGAACTCTGAGGCCCAGGCAGTCATTCAGGGAGCCCTTAATGC 

3781 + + + + + + 3840 

DFLKKKNS EAQAVIQGALNA- 

CTCAGAAACCACACC CAAAGAACTTCGGAT C AAGAG ACAAAACTC CT CAGATAGCAT CTC 

3841 + + + + + + 3900 

SETTPKELRIKRQNSSDSIS- 

AAGCCTCAACAGCATCACTAGCCATTCCAGCATCGGCAGCAGCAAGGATGCTGATGCGAA 

3901 + + + + + + 3960 

SLWSITSHSSIGSSKDADAK- 

AAAGAAGAAAAAAAAGAGTTGGCTTCGAAGTTCCTTCAACAAAGCGTTCAGTATAAAAAA 

3961 + + + + + + 4020 

KK.KKKSWLRS S FNKAP S I K K - 

GGGGCCCAAGTCAGCTTCCTCATACTCGGATATAGAGGAGATTGCTACACCCGACTCTTC 

4021 + + + + + + 4080 

GPKSASSYSDIEEIATPDSS 

AGCCCCCTCATCCCCCAAACTACAGCATGGTTCTACAGAGACTGCTTCACCCTCCATCAA 

4081 + + + + + + 4140 

APS S PKLQHGS T ETAS PS I K - 

GTCCTCCACCTCGTCCTCCGTGGGCACTGATGTCACCGAGGGCCCTGCTCACCCAGCCCC 

4141 + + + + + + 4200 

S STS S SVGTDVTEGPAHPAP 

C CAC ACTAGGCTGTTCC ATGCAAATGAGGAGGAGGAGC CAGAGAAGAAGGAGGTATCGGA 

4201 + + + + + + 4260 

HTRLFHANEEEEPEKKEVSE- 

^GCTGCGCTCTGAGCTATGGGAGAAGGAAATGAAGCTTACAGACATCCGCTTGGAGGCCCT 

4261 + "- + + + + + 43 20 

LRSELWEKEMKLTD I RLEAL 

CAACTCTGCCCACCAACTGGATCAGCTTCGGGAGACCATGCACAACATGCAGTTGGAGGT 

4321 + + + + + + 4380 

NSAHQLDQLRETMHMMQIiEV 

GGACCTGCTGGAAGCAGAGAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATCAGGCTC 

4381 + + + + + + 4440 

DLLEAENDRLKVAPG P S S -3 - 

Fig- IB (cont'd 4) 
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CACTCCAGGGCAGGTCCCTGGATCATCTGCATTATCTTCCCCACGCCGCTCCCTAGGCCT 

4441 + + + + + + 4500 

TPGQVPGSSALSSPRRSLGL- 

GGCACTCACCCATTCCTTCGGCCCCAGTCTTGCAGACACAGACCTGTCACCCATGGATGG 

4501 + + + + + + 4560 

ALTHSFGPSLADTDLSPMDG- 

CATCAGTACTTGTGGTCCAAAGGAGGAAGTGACCCTCCGGGTGGTGGTGAGGATGCCCCC 

4561 + + + + + + 4620 

I STCGPKEEVTLRVVVRMPP 

GCAGCACATCATCAAAGGGGACTTGAAGCAGCAGGAATTCTTCCTGGGCTGTAGCAAGGT 

4621 + + + + + " + 4680 

Q _ H I IKGDLKQQEFF,LGCSKV- 

CAGTGGAAAAGTTGACTGGAAGATGCTGGATGAAGCTGTTTTCCAAGTGTTCAAGGACTA 

4681 + + + + + + 4740 

SGKVDWKMLDEAVFQVFKDY 

TATTTCTAAAATGGACCCAGCCTCTACCCTGGGACTAAGCACTGAGTCCATCCATGGCTA 

4741 + + + + + + 4800 

ISKMDPASTLGLSTES I H G Y - 

CAGCATCAGCCACGTGAAACGAGTGTTGGATGCAGAGCCCCCCGAGATGCCTCCTTGCCG 

4801 + + + + + + 4860 

S ISHVKRVLDAEPPEMPPCR 

TCGAGGTGTCAATAACATATCAGTCTCCCTCAAAGGTCTGAAGGAGAAATGCGTCGACAG 

4861 + + + + + + 4920 

RGVNNI SVSLrKGLKEKCVDS 

CCTGGTGTTCGAGACGCTGATCCCCAAGCCGATGATGCAGCACTACATAAGCCTCCTGCT 

4921 + + + + + - + 4980 

LVFETLIPKPMMQHYISLLL- 

GAAGCACCGGCGCCTCGTCCTCTCGGGCCCCAGCGGCACGGGCAAGACCTACCTGACCAA 

4981 + + + + + + 5040 

KHRRLVLSGPSGTGKTYLTN- 

TCGCTTGGCCGAGTACCTGGTGGAGCGCTCTGGCCGTGAGGTCACAGAGGGCATCGTCAG 

5041 + + + + + + 5100 

RLAEYLV ERSGREVTEGIVS 

CACCTTCAACATGCACCAGCAGTCTTGCAAGGATCTGCAACTGTATCTTTCCAACCTAGC 

5101 + + + ----- + + + 5160 

TFNMHQQSCKDLQLYLSNLA- 



* CAACCAGAT AGAC CGGGAAACAGGAATTGGGG ATGTGCCC CTGGTGATTCT ATTGGATG A 

+ + + + H + 

NQIDRETGIGDVPLVILLDD 
CCTGAGTGAAGCAGGCTCCATCAGTGAGTTGGTCAATGGGGCCCTCACCTGCAAGTATCA 

+ + f. + H h 

LSEAGS ISELVNGALTCKYH 



TAAATGTCCCTATATTATAGGTACCACCAATCAGCCTGTAAAAATGACACCCAACCATGG 

5281 + + + + + 5340 

KCPYI IGTTNQPVKMTPNHL-G 



Fig. 16 (cont'd 5) 
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CTTTCACTTGAGCTTCAGGATGTTGACCTTCTCCAACAACGTGGAGCCAGCCAATGGCTT 

5341 + + + + + + 5400 

FHLS FRMLTFSNNVE PANG F 

CCTGGTTCGTTACCTGAGGAGGAAGCTGGTAGAGTCAGACAGCGACATCAATGCCAACAA 

5401 + + + + + + 5460 

LVRYLRRKLVESDSDINANK- 

GGAAGAGCTGCTTCGGGTGCTCGACTGGGTACCCAAGCTGTGGTATCATCTCCACACCTT 

5461 + + + + + + 5520 

EELiLRVLDWVPKLWYHLHTF 

CCTTGAGAAGCACAGCACCTCAGACTTCCTCATCGGCCCTTGCTTCTTTCTGTCGTGTCC 

5521 + + + + + + 5580 

L EKHSTSDFLIGPC FFLSCP- 

CATTGGCATTGAGGACTTCCGGACCTGGTTCATTGACCTGTGGAACAACTCTATCATTCC 

5581 + + + + + + 5640 

IGIEDFRTWFIDLWNNSIIP- 

CTATCTACAGGAAGGAGCCAAGGATGGGATAAAGGTCCATGGACAGAAAGCTGCTTGGGA 

5641 + + + + + + 5700 

YLQEGAKDGI KVHGQKAAWE 

GGACCCAGTGGAATGGGTCCGGGACACACTTCCCTGGCCATCAGCCCAACAAGACCAATC 

5701 + + + + + + 5760 

DPVEWVRDTLPWPSAQQDQS 

AAAGCTGTACCACCTGCCCCCACCCACCGTGGGCCCTCACAGCATTGCCTCACCTCCCGA 

5761 + + + + + + 5820 

KLYHLPPPTVGPHS IASPPE- 

GGATAGGACAGTCAAAGACAGCACCCCAAGTTCTCTGGACTCAGATCCTCtGATGGCCAT 

5821 + + + + + + 5880 

DRTVKDSTPSSLDSDPLMAM- 

GCTGCTGAAACTTCAAGAAGCTGCCAACTACATTGAGTCTCCAGATCGAGAAACCATCCT 

5881 + + + + + " + 5940 

LLKLQEAANYIESPDRETIL- 

GGACCCCAACCTTCAQGCAACACTTTAAGGGTTCGGCAATCACTGTCACCCCCGGACAGC 

5941 + + + + + + 6000 

DPNLQATL* 

AGAACGCTGGCATCAGCTATCTTAGCTCCTCCTCTCCCCTCTCCTCTTTCAGAGCACTGG 
6001 + + + " + + + 6060 

CTCTCCAGCCCCAGGAGGAGAACAGGAGGGAGGAGGAGATGAAAGAGGAGGGACAGGTTC 
6061- + + + + + + 6120 

TTGGTGCTGTACCTTTGAGAACTTCCTAGGAAGGAATGGTGGGGTGGCGTTTGGGAACTT 
6121 + + + + + + 6180 

GTGCCCCCTAAACACATTTACTGGCCTCCTCTAATGACTTTGGGGAAAAGATGATTCTGG 
6181 + + + + + + 6240 

GTCTTTCCCTTGACTTCTTGTTTCAATTACAAACTCCTGGGCTTTCTGGGGAGGGGTTCA 
6241 + - + + + + + 6300 



Fig. 16 (cont'd 6) 
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GAAAACATCAAAACACTGCAGCAGTTCCTAAATGATTCTCACAAGCAACCCTGAGAGAGA 
6301 + + + + + + 6360 

CAGTCTTGTGAGGGAGATCTGGGGGAGGCAGGAAGCTCCTCAGATTTTCTCACAGACCCT 
6361 + + + + + + 6420 

TCCCAATTCCATCACCACTGCCAACAACTCCTCCCCCAGAGATCTGGCTGGAGCCCAGAA 
6421 + + + + + + 6480 

AAAG AAG C ATGTGGTTT AAAAAATGTTT AAAT C AATCTGT AAAAGGT AAAAATG AAAAAC 
6481 + + + + + + 6540 

AAAAACAAGCAAACAAACAAAAAACAATGGAAAAGATGAAGCTGGAGAGAGAGGAACCAG 
6541 + + + + + 1 + 6600 

TTGCCAAGGTAGAGAGCTGCCCGCTCCTGCCCTCTGGATGACATAGGGGACATCAACAAG 
6601 + + + + + + 6660 

ACGGCTGCCAACCTGAGAAGTCACCAAACCACAAAAATAACCTTACAGCCTTCAGGGAAA 
6661 + + + + + + 6720 

GACTACCAGCTCTGTCTTTCTACCCTCTAATTTAACAATGCATAAGAGTCAATAAACCCT 
6721 + + - + + + + 6780 

ACTTTTTTAAAAAAAAAAAAAAAAG 
6781 + + 6805 



Fig. 16 (cont'd 7) 
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T3-CDNA sequence and T3 protein 
encoded therein (protein isoform 1) 



CAACCAGCCAGAACGCCTGAACTCGCAGGTGCTGCAGGGGCTGCAGGAGCCAGCGGGGGA 

1 + + + + + + 60 

NQPERLNSQVLQGLQEPAGE - 

GGGGCTCCCGCTGCGGAAGAGCGGCTCGGTGGAAAACGGGTTCGATACCCAGATCTACAC 

61 + + + + + + 120 

GliPLRKSGSVENGFDTQIYT- 

AGACTGGGCCAATCATTACCTAGCCAAATCCGGCCACAAGCGTCTCATCAGGGATCTCCA 

121 + + + + + + 180 

DWANHYLAKSGHKRL IRDLQ- 

GCAAGATGTGACAGATGGCGTCCTCCTGGCCCAGATTATCCAGGTTGTGGCAAATGAAAA 

181 + + + + + + 240 

QDVTDGVLLAQI IQVVANEK- 

GATTGAAGACATCAATGGCTGTCCGAAGAACAGATCCCAAATGATTGAAAACATAGATGC 

241 + + + + + + 300 

IED INGCPKNR SQM I ENIDA- 

CTGCTTGAATTTCCTGGCAGCTAAGGGAATAAACATCCAGGGGCTGTCTGCAGAAGAGAT 

301 + + + + + + 360 

CLNFLAAKGINIQGLSAEEI- 

CAGGAATGGAAACCTCAAGGCCATTCTAGGCCTCTTCTTCAGCCTCTCCCGATACAAGCA 

361 + + + + + + 420 

RNGNLKAILGLFFSLSRYKQ- 

GCAGCAGCAGCAGCCCCAGAAGCAGCACCTCTCCTCACCTCTGCCGCCCGCCGTATCCCA 

421 + + + + + + 480 

QQQQPQKQHLSSPLPPAVSQ- 



GGTGGCCGGGGCCCCCTCCCAGTGCCAGGCTGGCACCCCTCAGCAGCAGGTGCCAGTCAC 

+ + + + + + 

VAGAPSQCQAGTPQQQVPVT 



TCCCCAAGCCCCGTGCCAGCCTCACCAGCCAGCGCCACATCAGCAGTCAAAAGCACAAGC 

541 + + - + + + + 600 

PQAPCQPHQPAPHQQS K A Q A ~ 

TGAAATGCAGTCCAGACTTCCAGGTCCTACCGCGAGGGTATCCGCTGCAGGCAGCGAGGC 

60 i + + + + + + 660 

EMQSRLPGPTARVSAAGSEA- 



CAAAACACGCGGAGGGTCAACTACTGCTAACAACCGACGCAGCCAGAGCTTTAACAACTA 

+ + + + + + 

KTRGGSTTANNRRSQS FNNY 

TGATAAATCCAAACCAGTCACCTCCCCACCCCCACCGCCAAGCAGCCACGAGAAAGAGCC 

+ + + + + - + 

DKSKPVTSPPPPPSSHEKEP 



TTTGGCAAGTTCAGCCTCCTCCCACCCCGGAATGAGTGACAATGCACCTGCTTCCTTGGA 

781 + + + + + 840 

LASSASSHPGMSDNAPASlfE_- 

Fig- 17 
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GAGCGGCAGCAGCTCCACCCCTACTAATTGCAGTACCTCCTCGGCCATCCCGCAGCCCGG 

841 + + + + + + 900 

SGSSSTPTNCSTSSAIPQPG- 

TGCAGCCACCAAGCCTTGGCGCAGCAAATCCCTCAGCGTGAAGCACAGTGCCACGGTATC 

901 + + + + + + 960 

AATKPWRS KSLSVKHSATVS 

CATGCTCTCGGTCAAGCCTCCTGGGCCTGAGGCCCCCAGGCCCACACCTGAAGCCATGAA 

961 + + + + + + 1020 

MLSVKP PGPEAPRPTPEAMK- 

GCCGGCCCCCAACAATCAGAAGTCCATGCTGGAAAAGCTGAAACTTTTCAACAGTAAAGG 

1021 + + + + + + 1080 

PAPNNQKSMLEKLKLFNSKG 

GGGCTCAAAGGCAGGTGAGGGGCCGGGGTCCCGGGACACAAGCTGTGAGCGGCTGGAGAC 

1081 + + + + + + 1140 

GSKAGEGPGSRDTS CERLET 

TCTGCCCAGCTTCGAAGAGAGCGAGGAGCTGGAGGCCGCCAGTCGCATGCTCACCACCGT 

1141 + + + + + + 1200 

LPS FEESEEIiEAASRMLTTV- 

GGGCCCTGCTTCCAGCAGCCCCAAGATTGCACTCAAGGGCATTGCCCAGAGGACTTTTAG 

1201 + + + + + + 1260 

GPAS SS PKIALKGIAQRTFS - 

CCGGGCACTGACCAACAAGAAGAGTTCTCTGAAAGGCAATGAGAAAGAGAAGGAGAAACA 

1261 + + + + + + 1320 

RALTNK.KSSLKGNEKEKEK-Q- 

AC AGCGGGAGAAGGATAAGGAGAAAAGCAAGGACCTTGCC AAGAGAG CCT CTGTGACGGA 

1321 + + + + + - r 1 + 1380 

QREKDKEKSKDLAKRASVT E- 

GAGGCTGGACCTCAAGGAGGAGCCAAAAGAAGACCCCAGTGGAGCAGCTGTGCCCGAGAT 

1381 + + " + + + + 1440 

RLDLKEEPKEDPSGAAVPEM 

GCCAAAAAAGTCCTCCAAGATTGCCAGCTTCATCCCCAAAGGGGGGAAGCTCAACAGTGC 

144 1 + + - + + + + 1500 

PKKSSKIASFIPKGGKLNSA- 

CAAGAAGGAGC C C ATGG CC C CTTCC C ACAGTGGAATAC CAAAAC CAGGAATG AAG AGC AT 

150 i + + + + + +. 1560 

KKEPMAPSHSGIPKPGMKSM- 

GCCCGGGAAATCCCCAAGTGCCCCAGCGCCTTCCAAGGAAGGGGAGCGGAGCCGGAGTGG 

1561 ' + + + + + + 1^20 

PGKS PSAPAPSKEGERSRSG 

GAAGCTGAGCTCAGGACTCCCCCAGCAGAAGCCCCAGCTGGACGGGAGACACTCCAGTTC 

1621 + + + + + + 1680 

KLSSGIiPQQKPQLDGRHSSS - 

CTCTTCCAGCCTGGCGTCCTCAGAAGGAAAAGGCCCAGGAGGGACCACCCTGAACCACAG 

1681 + + + + + + 1740 

SSS. LA:SSEGKGPGGTTLNIFS - 

Fig. 17 (cont'd 1) 
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CATCAGCAGCCAGACTGTCAGTGGGTCTGTCGGGACCACCCAGACCACAGGAAGCAATAC 

1741 + + + + + - -+ 1800 

I SS QTVSGSVGTTQTTGSN'T 

CGTCAGTGTTCAGCTACCTCAGCCCCAGCAGCAATACAACCATCCCAACACTGCCACGGT 

1801 + + + + + + I860 

VSVQLPQPQQQYNHPNTATV- 

TGCACCTTTCCTGTACAGGTCTCAGACGGACACTGAAGGGAATGTTACTGCCGAGTCAAG 

1861 + + + + + + 1920 

APFLYRSQTDTEGNVTAESS 

CTCAACAGGTGTGAGCGTGGAGCCCAGCCACTTCACCAAGACTGGACAGCCTGCTCTGGA 

1921 + + + + + + 1980 

STGVSVEPSHFTKTGQPAL.E 

AGAACTCACTGGGGAAGATCCTGAGGCTCGGCGGCTGCGGACAGTGAAGAACATCGCTGA 

1981 + + + + + + 2040 

ELTGEDPEARRLRTVKNIAD 

TCTGCGGCAGAATTTGGAGGAAACCATGTCCAGTTTAAGGGGAACTCAGGTTACACACAG 

2041 + + + + + + 2100 

LRQNLiEETMSSLRGTQVTHS 

CACATTGGAAACCACGTTTGACACCAATGTCACCACGGAGATGAGTGGCCGTAGCATACT 

2101 + + + + " + + 2160 

TLETTFD TNVTTEMSGRS I L 

CAGCTTGACAGGGAGGCCCACACCTCTGTCCTGGAGACTGGGCCAGTCCAGCCCTCGGCT 

2161 + ---- + + ---- + + + 2220 

SLTGRPTPLSWRLGQSSPRL- 

CCAAGCAGGAGACGCCCCCTCAATGGGCAATGGGTATCCCCCTCGAGCCAACGCCAGCAG 

2221 + + + + + ~ + 2280 

QAGDAPSMGNGYPPRANASR - 

GTTCATCAACACTGAGTCAGGTCGCTATGTGTACTCCGCCCCTCTGAGAAGGCAGCTGGC 

2281 + + + + + + 2340 

F1NTESGRYVYSAPLRRQLA- 

CTCCCGGGGCAGTAGTGTCTGCCACGTGGACGTCTCAGACAAGGCAGGAGATGAGATGGA 

2341 + + . + + + + 2400 

SRGSSVCHVDVSDKAGDEMD - 

CCTGGAAGGCATCAGCATGGACGCCCCCGGCTACATGAGCGATGGGGATGTTCTGAGCAA 

2401 + + + + + + 2460 

LEGISMDAPGYMSDGDVLSK- 

GAACATCCGGACCGATGACATTACAAGCGGATACATGACTGATGGTGGACTTGGCCTCTA 

2461 * + + + + + + 2520 

NIRTDDITSGYMTDGGLGLY- 



TACCCGTCGCCTGAACCGGCTCCCTGATGGGATGGCTGTGGTACGGGAGACCCTGCAACG 

2521 + + + + + + 

TRRLNRLPDGMAVVRETLQR 



AAATACCTCCCTGGGCCTCGGAGACGCTGACAGCTGGGACGACAGCAGCTCCGTCAGCAG 

+ + - + + + - + 

NTSLGLGDADSWDDSSSVSS 



Fig. 17 > (cont ' d 2 ) 
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CGGCATCAGCGACACCATAGACAACCTCAGCACTGATGACATCAACACCAGCTCCTCCAT 

2641 + + + + + + 2700 

G I SDT IDNLSTDD INT S S S I 

CAGCTCTTATGCCAACACACCTGCCTCCTCTCGAAAAAACCTGGATGTGCAGACTGATGC 

2701 + + + + + + 2760 

SSYANTPASSRKNLDVQTDA- 

TGAGAAGCACTCACAGGTGGAGAGGAATTCCCTGTGGTCTGGTGATGATGTCAAGAAATC 

2761 + + + + + + 2820 

EKHSQVERNSLWSGDDVKKS 

AG ACGGAGG CTCAGACAGCGG CATAAAAATGGAG CCAGGTTC C AAGTGG AGGCGGAAT C C 

2821 + + + + + + 2880 

DGGSDSGIKMEPGSKWRRNP- 

TTCTGATGTGTCTGACGAGTCCGACAAAAGCACGTCGGGCAAGAAGAATCCTGTCATCTC 

2881 + + + + + + 2940 

SDVSDESDKSTSGKKNPVIS - 

CCAGACAGGCTCATGGCGGCGAGGCATGACAGCTCAGGTGGGCATCACCATGCCAAGGAC 

2941 + + + + + + 3000 

QTGSWRRGMTAQVG ITMPRT 

GAAGGCTTCAGCCCCGGCAGGCGCACTGAAGACCCCAGGAACTGGAAAAACAGACGACGC 

3001 _ + + + + + + 3060 

KASAPAGALKTPGTGKTDDA- 

AAAGGTGTCTGAGAAAGGAAGGCTTTCTCCTAAAGCCTCCCAGGTGAAGCGCTCCCCATC 

3061 + + + + 4- + 3120 

KVSEKGRLSPKASQVKRSPS - 

AGATGCAGGCCGGAGCAGTGGTGACGAATCCAAAAAGCCCCTCCCCAGCAGCTCTAGGAC 

3121 + + + + + + 3180 

DAGRSSGDESKKPLPSSSRT- 



ACCTACTGCCAATGCCAACAGCTTTGGGTTCAAGAAGCAGAGTGGTTCCGCCACCGGCCT 

h + + + + + 

P-TANANSFGFKKQSGSATGL 
GGCCATGATCACAGCCAGCGGGGTGACTGTCACCAGCAGGTCAGCCACACTGGGCAAAAT 

h + ^ + h H + 

AMITASGVTVTSRSATLGKI 

CCCAAAGTCATCTGCACTCGTCAGTCGGTCTGCTGGTCGGAAGTCAAGTATGGATGGGGC 

+ + + + + + 

PKS SALVSRSAGRKS SMDGA 

TCAGAATCAGGATGACGGGTATCTAGCCCTAAGCTCCCGGACAAACCTTCAGTACCGGAG 

* h + + + + + 

QNQDDGYLALSSRTNLQYRS 

TTTGCCGAGGCCCAGTAAGTCCAACAGCCGGAACGGGGCTGGGAACAGGTCTAGCACCAG 

+ + + + + + 

LPRPSKSNSRNGAGNRSSTS 



CAGCATAGATTCCAACATTAGCAGCAAGTCCGCAGGCCTGCCAGTGCCCAAACTGAGGGA 

3481 + + + + + + 3540 

SIDSNISSKSAGLPVPKLI?E- 

Fig. 17 .(cont'd 3) 
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GCCTTCCAAAACAGCCCTAGGCAGCTCTCTACCAGGTCTGGTCAACCAAACAGACAAGGA 
3541 + + + + + + 3600 

psktalgsslpglvnqtdk'e- 
gaaaggcatctcatcagacaacgagagtgtggcttcctgtaactcggtgaaagtgaatcc 

3601 + + + + + + 3660 

KGISSDNESVASCNSVKVNP- 

GGCAGCCCAGCCTGTGTCCAGTCCGGCTCAGACCAGTCTCCAGCCTGGAGCCAAGTACCC 

3661 + + + + + + 3720 

AAQPVSSPAQTSLQPGAKYP- 

AGATGTGGCCTCTCCCACACTCCGCAGACTCTTTGGTGGGAAGCCTACCAAGCAAGTGCC 

3721 + + + + + + 3780 

DVAS PTLiRRLFGGKPTKQVP 

CATCGCCACAGCTGAAAACATGAAAAATTCGGTGGTCATCTCCAATCCTCATGCCACCAT 

3781 + + + + + + 3840 

IATAENMKNSVVI SNP HATM - 

GACTCAGCAAGGTAACCTAGACTCCCCGTCAGGCAGTGGCGTCCTGAGCAGTGGGAGCAG 

3841 + + + + + + 3900 

TQQGNLDSPSGSGVLSSGSS- 

CAGTCCTCTCTACAGCAAGAATGTGGACCTCAACCAGTCTCCGCTAGCCTCCAGCCCCAG 

3901 + + + + + + 3960 

SPLYSKNVDLNQSPLASSPS- 

CTCAGCCCACTCGGCCCCTTCCAACAGCCTCACCTGGGGCACCAACGCCAGCAGCTCCTC 

3961 + + + + + + 4020 

SAHSAPSNSLTWGTNASSSS- 

CGCAGTTAGCAAGGATGGCCTGGGCTTTCAGTCTGTCAGCAGCCTCCACACCAGCTGTGA 

4021 + + + + + " + 4080 

AVSKDGL.GFQSVSSLHTSCE- 

GTCCATCGACATCTCCCTCAGCAGTGGAGGGGTGCCCAGCCACAATTCTTCCACTGGCCT 

4081 + + + + + + 4140 

SIDISLSSGGVPSHN-SSTGL- 

CATCGCCTCCTCCAAGGACGACTCCTTGACTCCCTTTGTCAGAACTAACAGTGTGAAGAC 

4141 + + - + + + + 4200 

IASSKDDSLTPFVRTNSVKT- 

CACACTGTCAGAAAGCCCTCTCTCTTCCCCTGCTGCTAGCCCTAAGTTCTGCAGAAGTAC 

4201 + + + + + + 4260 

TLSESPLSSPA ASPKFCRST- 

TCTGCCCAGGAAACAGGACAGTGACCCGCACCTTGATAGGAACACTTTGCCTAAGAAAGG 

4261- + + + + + + 4320 

LiPRKQDSDPHLDRNTLiPKKG- 

ACTCAGGTATACTCCCACCTCCCAGCTTCGCACGCAAGAAGATGCAAAAGAATGGTTACG 

4321 + + + + + + 4380 

LRYTPTSQLRTQEDAKEWLR- 

GTCCCATTCTGCAGGAGGCCTTCAGGACACCGCTGCCAATTCCCCCTTTTCCTCTGGCTC 

4381 + + + + + " + 4440 

SHSAGGLQDTAANSPFS SO^S - 



Fig. 17 (cont'd 4) 
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CAGCGTGACTTCTCCCTCCGGAACAAGATTCAACTTTTCCCAGCTTGCGAGTCCCACCAC 

4441 + + + + + + 4500 

SVTS PSGTRFNFSQLAS P T " T 

TGTCACCCAGATGAGCTTGTCCAACCCGACCATGCTGAGGACTCACAGCCTCTCCAATGC 

4501 + + + + + + 4560 

VTQMSliSNPTMLRTHSLSNA- 

TGATGGGCAGTATGATCCATACACTGACAGCCGCTTCCGGAATAGCTCCATGTCCCTGGA 

4561 + + + + + + 4620 

DGQYDPYTDSRFRNSSMSLD- 

TGAGAAGAGCAGAACCATGAGCCGTTCAGGCTCATTCCGGGATGGGTTTGAAGAAGTTCA 

4621 + + + + + + 4680 

EKSRTMSRSGSFRDGFEEVH 

TGGATCCTCACTCTCCTTGGTTTCCAGCACATCGTCAGTTTATTCTACACCAGAAGAAAA 

4681 + + + + + - + 4740 

GSSLSLVSSTSSVYSTPEEK- 

ATGCCAGTCAGAGATTCGCAAGCTGCGGCGGGAACTGGATGCCTCCCAGGAGAAAGTTTC 

4741 + + + + + + 4800 

CQSE IRKLRRELDASQEKVS 

AGCTTTGACCACCCAGCTGACAGCAAATGCTCACCTTGTGGCTGCCTTTGAACAGAGTCT 

4801 + + + + + + 4860 

ALTT QLTANAHLVAAF E Q S L. 

TGGTAACATGACAATCAGGCTC C AGAGTCTGACC ATGACAGCTGAG C AGAAGGATTCAGA 

4861 + + + + + ' + 4920 

GNMT IRLQSLTMTAEQKDSE 

ACTGAATGAGTTAAGAAAAACCATTGAGCTGCTAAAGAAACAGAACGCAGCTGCCCAGGC 

4921 + + + + + --: + 4980 

LNELRKTIELLKKQNA AAQA- 

TGC C ATTAATGGAGTAATTAACACAC CTGAGCTCAACTGC AAAGGAAACGGC ACTGCCC A 

4981 + + + + + + 5040 

AINGVINTPELNCKGNGTAQ- 

GTCTGCAGACCTCCGCATCCGCAGGCAGCACTCCTCAGACAGCGTCTCCAGCATCAACAG 

5041 + + ~ + + + + 5100 

SADLRIRRQHSSDSVSSINS - 

TGCCACCAGCCACTCCAGTGTGGGCAGCAACATAGAGAGTGACTCAAAGAAGAAGAAGAG 

5101 + - + + + + + 5160 

ATSHSSVGSNIESDSKKKKR- 

GAAGAACTGGGTCAATGAGTTACGCAGCTCCTTCAAGCAAGCTTTCGGGAAGAAGAAGTC 

5161- + + + + + + 5220 

KNWVNELRSSFKQAFGKKKS 

CCCAAAATCTGCGTCCTCTCATTCAGATATTGAGGAGATGACGGATTCTTCTTTGCCTTC 

5221 + + + + + + 5280 

PKSASSHSDIEEMTDSSLPS 

CTCACCAAAGTTACCACACAATGGGTCCACAGGTTCCACCCCACTGCTGAGGAATTCTCA 

5281 + + + + + + 5340 

SPKLPHNGSTGSTPLLRNSH- 



Fig. 17 (cont'd 5) 
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CTCCAACTCTCTAATTTCAGAATGCATGGATAGTGAAGCTGAGACCGTCATGCAGCTCCG 

5341 + + + + + + 5400 

SNSIjISECMDS EAETVMQ L R 

AAATGAGTTAAGAGACAAGGAGATGAAGCTGACAGATATCCGCTTAGAAGCTCTCAGTTC 

5401 + + + + + + 5460 

NELRDKEMKLTDIRLEALSS 

TGCCCACCAGCTGGACCAGCTCCGGGAGGCCATGAACAGGATGCAGAGTGAAATAGAGAA 

5461 + + + + + + 5520 

AHQLDQLREAMNRMQSEIEK- 

GCTGAAAGCTGAGAATGATCGGCTGAAGTCAGAGTCTCAAGGCAGTGGCTGCAGCCGGGC 

5521 + + + + + + 5580 

LKAENDRL.KSESQGSGCSRA- 

TCCTTCCCAAGTGTCCATCTCTGCCTCCCCGAGGCAGTCCATGGGCCTCTCCCAGCACAG 

5581 + + + + + + 5640 

PSQVSISASPRQSMG-LSQHS- 

CTTGAACCTCACTGAGTCAACCAGCCTGGACATGTTGCTGGATGACACTGGTGAATGCTC 

5641 + + + + + + 5700 

LiNLiTE STS LDMLLDDTGECS 

GGCTCGGAAGGAAGGAGGCAGGCATGTTAAGATAGTTGTCAGCTTTCAGGAGGAAATGAA 

5701 + + + + + + 5760 

ARKEGGRHVKIVVS FQE EMK- 

GTGGAAGGAGGATTCCAGACCACATCTCTTTCTTATTGGCTGCATTGGAGTTAGTGGCAA 

5761 + + + + + + 5820 

WKEDSRPHLFLIGCIGVSGK- 

GACGAAGTGGGATGTGCTCGATGGGGTGGTTAGACGGCTGTTCAAAGAATACATCATTCA 

5821 + + + + + -; + 5880 

TKWDVLDGVVRRLFKEYIIH- 

TGTCGACCCAGTGAGTCAGCTAGGGCTGAATTCAGACAGCGTTCTTGGCTACAGCATTGG 

5881 + + + + + + 5940 

VDPVSQ LGLNSDSVLGYSIG- 



AGAAATCAAGCGCAGCAACACTTCCGAAACACCGGAGCTGCTTCCTTGTGGCTATCTGGT 

+ h - + + + - - h 

EIKRSNTSETPELLPCGYLV 



TGGAGAGAACACGACCATCTCAGTGACTGTGAAAGGGCTCGCAGAAAACAGCCTGGACTC 

6001 + + + + + + 6060 

GENTTI SVTVKGLAENS LDS 



ACTGGTGTTTGAGTCCTTGATTCCCAAGCCCATCCTGCAGCGCTACGTCTCCCTCCTGAT 

+ + + + + + 

LVFESLIPKPILQRYVSLLI 

AGAGCACCGTCGGATCATTCTCTCTGGCCCCAGCGGCACTGGGAAAACCTACCTGGCCAA 

+ + + + + + 

EHRRI ILSGPSGTGKTYLAN 



CCGGCTGTCTGAGTATATAGTGCTTCGAGAGGGACGGGAGTTGACAGACGGGGTTATCGC 

6181 + + + + + " + 6240 

RLSEYIVLREGRELTDGV^A- 



Fig. 17 (cont'd 6) 
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CACCTTTAACGTGGACCATAAGTCCAGCAAGGAATTGCGCCAGTACCTGTCCAACCTTGC 

6241 + + + + + + 6300 

TFNVDHKSS KELRQYLSNLA- 

TGACCAGTGCAACAGTGAGAACAATGCTGTGGACATGCCCCTCGTCATCATCCTGGACAA 

6301 + + + + + + 6360 

DQCNS ENNAVDMPLVI ILDN- 

CCTACACCACGTGAGCTCTCTGGGCGAGATCTTCAATGGGCTGCTCAACTGCAAGTACCA 

6361 + + + + + + 6420 

GGATGTGGTGCACTCGAGAGACCCGCTCTAGAAGTTACCCGACGAGTTGACGTTCATGGT 
LHHVS SIjGE IFNGLtLNCKYH 

CAAATGCCCTTACATAATTGGCACAATGAACCAGGCTACCTCTTCGACTCCCAACCTGCA 

6421 + + + + + + 6480 

GTTTACGGGAATGTATTAACCGTGTTACTTGGTCCGATGGAGAAGCTGAGGGTTGGACGT 
KCPY I IGtMNQATS STPNLQ 

GCTTCACCATAACTTCAGATGGGTGCTTTGTGCCAACCACACGGAGCCTGTGAAGGGTTT 

6481 + + + + + + 6540 

CGAAGTGGTATTGAAGTCTACCCACGAAACACGGTTGGTGTGCCTCGGACACTTCCCAAA 
LHHNFRWVLCANHTEPVKGF 

CCTTGGCCGATTCCTGAGGAGGAAGCTCATGGAAACAGAGATCAGTGGGCGGGTGCGCAA 

6541 + + + + + + 6600 

GGAACCGGCTAAGGACTCCTCCTTCGAGTACCTTTGTCTCTAGTCACCCGCCCACGCGTT 
LGRFLRRKLMETEISGRVRN- 

TATGGAGCTGGTAAAAATCATTGACTGGATTCCCAAGGTCTGGCATCACCTCAACCGCTT 

6601 + + + + + + 6660 

ATACCTCGACCATTTTTAGTAACTGACCTAAGGGTTCCAGACCGTAGTGGAGTTGGCGAA 
MELVKIIDWIPKVWHHLNRF- 

CCTGGAGGCTCACAGTTCCTCGGACGTCACCATCGGCCCCCGGCTCTTCCTGTCATGCCC 

6661 + + + + + -'" + 6720 

LEAHSSSDVTIGPRLFLSCP- 

CATCGATGTGGACGGCTCGAGAGTGTGGTTCACCGACTTGTGGAACTATTCCATTATCCC 

672 i + + + + + + 6780 

IDVDGSRVWFTDLWNYS I I P 

CTATCTCCTGGAAGCCGTCAGAGAAGGACTCCAGCTCTATGGAAGGCGCGCCCCCTGGGA 

6781 + + + + + + 6840 

YLLEAVREGLQLYGRRAPWE - 

GGATC CTG CC AAGTGGGTGATGGAC AC AT ATCCATGGG CAGC CAGC C C ACAAC AGC ACG A 

6 84 .! + + + + + ---.-. + 6900 

pPAKWVMDTYPWAAS PQQHE 

GTGGCCTCCCCTGCTGCAGTTACGGCCTGAGGATGTCGGCTTCGACGGCTACTCCATGCC 

690 1 + + + + + + 6960 

WPPLLQLRPEDVGFDGYSMP- 

TCGGGAGGGATCGACAAGCAAGCAGATGCCCCCCAGTGATGCTGAAGGTGACCCGCTGAT 

6961 + + + + + + 7020 

REGSTSKQMPPSDAEGDPIiM- 

GAACATGCTGATGAGGCTGCAGGAGGCAGCCAACTACTCCAGCCCCCAGAGCTATGACAG 

7021 + + + + + + 7080 

-N MLMRLQEAANYSSPQSYfi. S - 

Fig. 17 (cont'd 7) 
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CGACTCCAACAGCAACAGCCATCACGATGACATCTTGGACTCCTCTTTGGAGTCCACTCT 

7081 + + + + + + 7140 

DSNSNSHHDDILDSSLESTL- 

GTGACAGGGGCCCGGAGCCCAGCGCCCTCCTCTTCTCCTCACCGCATTCCACCTGCATCC 

7141 + + + + + + 7200 

* - 

CCCACATCACCCTGAAGATGACTTCCTGAGCCAGCCCCCAGCCACAGCCTTAGAGCTGCG 
7201 + + + + + + 7260 

GGAACACCGAGACCCCCCGTCCTTCAGCCTCGACCTGGGTGCAGGCATCCCGGGCCAGCT 
7261 + + + + + -"-+ 7320 

GCCTGCGGACCGCTTCCTTCCACAGCGAGAACTGCACTACCTTCTGTTGTACTTTAATTA 
7321 + + + + + + 7380 

TTGTTTTGCCTTGTTGCTGTGACCTCCCTAAGACACTGAAGATACTTCTCGGGAAAGGAT 
7381 + + + + " + + 7440 

CATCGCCGTTGAAATGAAAAGAGAGACAGAGAGAGAAAAAAAAAAGAGAACCCACATGAA 
7441 -- + + + + + + 7500 

GCTCTGAAACCAAACAGCATCCTGCCATGAGCTTCCCAGAGACAGAAGAGACTGGAGCAA 
7501 + + + + + + 7560 

AGTCGGAAACACAGAGAAGCACGGCTTCCCCTCAGCACAGACCCTCCAGACTGGGTCTCA 
7561 + + + + + + 7620 

GAGCCGTGCCACCCACCCTCCCACACAGCCGGCCACAGGGAGAACTGGTGCTAACCAGGG 
7621 + + + + + + 7680 

TGCTTGCTTTGGTCACGTTCAACGCACTACAGAGCTACGACACAGGGGAACCTTAGGAGC 
7681 + + + + + - + 7740 

AAATAAACCGTGCTTTCATGTTTTTTAAAAAAAAAAAAAAAAA 
7741 + + + + 7783 



Fig. 17 (cont'd 8) 
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T3-CDNA sequence and T3 protein encoded 
therein (isoform 2) 



AGCAGGGAGAGGGGAGGGAGTGTGCCGTCTCTTCTGCAAGGGCAGTGCCCCAGCCTCAGC 

! + + + + + + 60 

SRERGGSVPSLLQGQCPSLS 

CACACTTCTGATCTGCAGTCCAACAGACCTTTCTAGCATGCCAAAGAGAACCTGGGGGTG 

61 + + + + + — -+ 120 

HTSDLQSNRPF*HAKENLiGV 



181 



CCAGGGGGTCCTCAGAGCTCACACTGCACTTGTGGCACCCACAGCGAGTAGCCATCCGTG 

+ + + + + + 

PGGPQSSHCTCGTHSE*PSV 

AGC CGAGGAAACTGTACACAGATCTACAC AGACTGGGC CAATC ATTAC CTAG C C AAATCC 

SRGNCTQIYTDWANHYLAKS 



240 



GGCCACAAGCGTCTCATCAAGGATCTCCAGCAAGATGTGACAGATGGCGTCCTCCTGGCC 

24i + + + + + + 300 

GHKRLIKDLQQDVTDGVL.LA 



CAGATTATCCAGGTTGTGGCAAATGAAAAGATTGAAGACATCAATGGCTGTCCGAAGAAC 

1 + + + + + + 

QIIQVVANEKIEDINGCPKN 

AGATCCCAAATGATTGAAAACATAGATGCCTGCTTGAATTTCCTGGCAGCTAAGGGAATA 

+ , + + + + + 

RSQMIENIDACLNFLAA KGI 

AACATCCAGGGGCTGTCTGCAGAAGAGATCAGGAATGGAAACCTCAAGGCCATTCTAGGC 

+ + |- -| H + 

NIQGLSAEEIRNGNLKAILG 



CTCTTCTTCAGCCTCTCCCGATACAAGCAGCAGCAGCAGCAGCCCCAGAAGCAGCACCTC 

481 + + + + + + 540 

LFFSLSRYKQQQQQPQKQHL 



TCCTCACCTCTGCCGCCCGCCGTATCCCAGGTGGCCGGGGCCCCCTCCCAGTGCCAGGCT 

+ + + H + - + 

SSPLPPAVSQVAGAPSQC Q A 

GGCACCCCTCAGCAGCAGGTGCCAGTCACTCCCCAAGCCCCGTGCCAGCCTCACCAGCCA 

+ h + + k + 

GTPQQQVPVTPQAPCQPHQP 



Fig. 18 
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T3 murine cDNA 



ATGAGAAGAGCCGAACAATGAGTCGGTCAGGCTCCTTCCGGGATGGGTTTGAGGAAGTTC 

1 + + + + + + 60 

EKS RTMSRSGS FRDGFEEVH- 

ATGGATCCTCCCTGTCCTTGGTTTCCAGCACATCCTCCATCTACTCCACGCCAGAAGAAA 

61 + + + + + + 120 

GSSLSLVSSTSSIYSTPEEK- 



AATGCCAGTCAGAGATTCGAAAGCTGAGGCGAGAACTGGATGCCTCCCAGGAAAAGGTGT 



+ + + + + + 180 

CQ S E IRK LRRE LD AS Q E KVS- 

AATGCCAGTCAGAGATTCGAAAGCTGAGGCGAGACGTGGATGCCTCCCAGGAAAAGGTGT 

+ + + + + + 180 

CQSEIRKLRRDVDASQEKVS- 

CTGCGCTGACTACCCAGCTGACTGCAAATGCTCACCTTGTGGCAGCCTTCGAGCAGAGTC 

+ + + + + + 240 

ALT TQLTANAHLVAAFE Q S L - 



121 



TGGGAAACATGACCATCAGGCTACAGAGTTTAACTATGACCGCTGAGCAGAAGGATTCAG 

241 + + + + + + 300 

GNMT IRLQS LTMTAEQKD S E - 



AACTGAACGAGTTAAGAAAAACCATCGAGCTGCTGAAGAAACAGAATGCAGCTGCCCAGG 

+ + + + + + 3( 

LiN E LRKT I E L L KKQNAAAQA- 

CTGCCATTAATGGAGTGATTAACACGCCAGAGCTCAACTGCAAAGGAAATGGCAGTGCCA 

+ + + + + + 4: 

A I N GV I NT P E L NC KGNG S AR- 



GGCTACAGACCTACGCATCCGCAGCAACACTCCTCCGACAGTGTCTCCAGTATCAATAGC 

421 + + + + + + 480 

LQTYASAATLLRQCLQYQ *R- 

GCCACCAGCCACTCAAGTGTG 

481 + + - 501 

H Q P L K C 



Fig # 19 
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CCTCCTCCACCTACTCCTCAC 
AS S T Y S S 



AGAAGAAAAAAAAGAGTTGGC 
K K K K S W 



AGCCTCTCCAACCGCTCGTAC 
S L S N R S Y 

GGCTCTCCAACCGCTCGTACC 
S L S N R S Y 

CCTCTCCAACCGCTCGTACCC 
L S N R S Y 

CTCTCCAACCGCTCGTACCCT 
L. S N R S Y P 

TCTCCAACCGCTCGTACCCTC 
L S N R S Y P 

CTCCAACCGCTCGTACCCTCT 
S N R S Y P 

TCCAACCGCTCGTACCCTCTG 
S N R S Y P L 

CCAACCGCTCGTACCCTCTGT 
S N R S Y P L 

CAACCGCTCGTACCCTCTGTC 
N R S Y P Ii 

AACCGCT CGT ACCCTCTGTCA 
N R S Y P L S 

AC CGCT CGT AC C CT CTGT CAT 
N R S Y P L S 

CCGCTCGTACCCTCTGTCATG 
R S Y P L S 

CGCT CGT ACCCTCTGTCATGG 
R S Y P L S W 

GCTCGTACCCTCTGTCATGGC 
S Y P L S W 

CTCGTACCCTCTGTCATGGCG 
S Y P L S W 

TCGTACCCTCTGTCATGGCGC 
S Y P Ii S W R 

CGTAC CCT CT GT CAT GGC GCT 
S Y P L S W R 



GTACCCTCTGTCATGGCGCTA 
Y P L S W R 



TACCCTCTGTCATGGCGCTAT 
Y P L S W R Y 



CTCCTCCACCTACTCCTCACA 
S S T Y S S 

TCCTCCACCTACTCCTCACAA 
S S T Y S S Q 

CCTCCACCTACTCCTCACAAA 
S S T Y S S Q 

CT CC ACCT ACT C CT C AC AAAT 
S T Y S S Q 

TCCACCTACTCCTCACAAATC 
S T Y S S Q I 

CC AC CT ACT C CT CAC AAAT C C 
S T Y S S Q I 

CACCT ACT CCT CACAAAT C CG 
T Y S S Q.I 

AC CT ACT C CT CACAAAT CCGG 
T Y S S Q I R 

CCTACTCCTCACAAATCCGGA 
T Y S S Q I R 

CT ACT C CT CACAAATC C GGAA 
Y S S Q I R 

TACT CCT CACAAATC CGGAAG 
Y S S Q I R K 

ACT C CT CACAAAT C CGG AAGC 
Y S S Q I R K 

CTCCTCACAAATCCGGAAGCT 
S S Q I R K 

T C CT CACAAAT CCGGAAGCTT 
S S Q I R K L 

CCTCACAAATCCGGAAGCTTC 
S S Q I R K L 

CTCACAAATCCGGAAGCTTCG 
S Q I R K L 

T CACAAAT CCGG AAGCTT CGT 
S Q I R K L R 

CACAAAT C CGGAAGCTT CGT A 
S Q I R K L R 

ACAAAT CCGGAAGCTT CGT AG 
Q I R K L R 



GAAGAAAAAAAAGAGTTGGCT 
K K K K S W 

AAGAAAAAAAAGAGTTGGCTT 
K K K K S W L 

AGAAAAAAAAGAGTTGGCTTC 
K K K K S W L 

GAAAAAAAAGAGTTGGCTTCG 
K K K S W L 

AAAAAAAAGAGTTGGCTT CGA 
K K K S W L R 

AAAAAAAGAGTTGGCTTCGAA 
K K K S W L R 

AAAAAAGAGTTGGCTTCGAAG 
K K S W L R 

AAAAAGAGTTGGCTTCGAAGT 
K K S W L R S 

AAAAGAGTTGGCTTCGAAGTT 
K K S W I* R S 

AAAGAGTTGGCTTCGAAGTTC 
K S W L R S 

AAGAGTTGGCTTCGAAGTTCC 
K S W L R S S 

AGAGTTGGCTTCGAAGTTCCT 
K S W L R S S 

GAGTTGGCTTCGAAGTTCCTT 
S W L R S S 

AGTTGGCTTCGAAGTTCCTTC 
S W L R S S F 

GTTGGCTTCGAAGTTCCTTCA 
S W L R S S F 

TTGGCTTCGAAGTTCCTTCAA 
W L R S S F 

TGGCTTCGAAGTTCCTTCAAC 
W L R S S F N 

GGCTTCGAAGTTCCTTCAACA 
W L R S S F N 

GCTTCGAAGTTCCTTCAACAA 
L R S S F N 



ACCCTCTGTCATGGCGCTATG 
Y P L S W R Y 



Fig. 20 
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CTCCATCAAGTCCTCCACCTC 
S I K S S T 



AGTTGGAGGTGGACCTGCTGG 
L, E V D L L. 



ATG AC AC C CAAC C ATGGCTTT 
M T P N H G F 



TC C AT C AAGT C CT C C AC CT CG 
S I K S S T S 



GTTGGAGGTGGACCTGCTGGA 
L E V D t« L 



TGACACCCAACGATGGCTTTC 
M T P N H G F 



CCATC AAGT CCT C C AC CT CGT 
S I K S S T S 



TTGGAGGTGGACCTGCTGGAA 
L, E V D L, L E 



GAC AC C CAAC CAT GGCTTTCA 
T P N H G F 



CAT C AAGT CCT CC AC CT CGT C 
I K S S T S 



TGGAGGTGGACCTGCTGGAAG 
h E V D L, L E 



ACACCCAACCATGGCTTTCAC 
T P N H G F H 



AT CAAGT C CT CC AC CT C GT C C 
I K S S T S S 



GGAGGTGGACCTGCTGGAAGC 
E V D L I* E 



CACCCAACCATGGCTTTCACT 
T P N H G F H 



TCAAGTCCTCCACCTCGTCCT 
I K S S T S S 



GAGGTGGACCTGCTGGAAGCA 
E V D L L E A 



AC C CAAC CAT GGCT TT C ACTT 
P N H G F H 



CAAGTCCTCCACCTCGTCCTC 
K S S T S S 



AGGT GGACCTGCTGGAAGCAG 
E V D L L E A 



CCC AACCATGGCTTTCACTTG 
P N H G F H L 



AAGTCCTCCACCTCGTCCTCC 
K S S T S S S 



GGTGG AC CT GCTGG AAGCAG A 
V D L Li E A 



CCAACCATGGCTTTCACTTGA 
P N H G F H Lt 



AGTCCTCCACCTCGTCCTCCG 
K S S T S S S 



GTGGAC CT G CTGGAAGCAG AG 
V D L L E A E 



CAACCATGGCTTTCACTTGAG 
N H G F H I* 



GTCCTCCACCTCGTCCTCCGT 
S S T S S S 



TGGACCTGCTGGAAGCAGAGA 
V D L L E A E 



AACCATGGCTTTCACTTGAGC 
N H G F H L S 



TCCTCCACCTCGTCCTCCGTG 
S S T S S S V 



GGACCT GCTGG AAGCAGAGAA 
D L Ii E A E 



ACCATGGCTTTCACTTGAGCT 
N H G F H Zi S 



CCTCCACCTCGTCCTCCGTGG 
S S T S S S V 



GACCTGCTGGAAGCAGAGAAT 
D L L E A E N 



CCATGGCTTTCACTTGAGCTT 
H G F H L S 



CTCCACCTCGTCCTCCGTGGG 
S T S S S V 



ACCTGCTGGAAGCAGAGAATG 
D L L E A E N 



CATGGCTTTCACTTGAGCTTC 
H G F H L S F 



TCCACCTCGTCCTCCGTGGGC 
S T S S S V G 



CCTGCTGG AAG CAGAG AAT GA 
L Li E A E N 



ATGGCTTTCACTTGAGCTTCA 
H G F H L, S F 



CCACCTCGTCCTCCGTGGGCA 
T S S S V G 



CT GCT GGAAGCAGAG AATGAC 
Li Li E A E N D 



TGGCTTT CACTTGAGCTTCAG 
G F H L S F 



C ACCT CGT C CT C CGT GGGC AC 
T S S S V G 



TGCTGGAAGCAG AGAAT GACC 
L L E A E N D 



GGCTTTCACTTGAGCTTCAGG 
G F H L S F R 



ACCTCGTCCTCCGTGGGCACT 
T S S S V G G 



GCTGGAAGCAGAGAATGACCG 
L. E A E N D 



GCTTTCACTTGAGCTTCAGGA 
G F H L S F R 



CCT CGTC CT C CGTGGGC ACT G 
T S S S V G G 



CT GG AAGCAGAGAAT GAC CGA 
L E A E N D R 



CTTT CACTTGAGCTT CAGGAT 
F H L, S F R 



CTCGTCCTCCGTGGGCACTGA 
S S S V G G 



T GG AAGCAG AGAATG AC CGAC 
L E A E N D R 



TTTCACTTGAGCTTCAGGATG 
F H Li S F R M 



T CGT C CT C C GT GGGC ACTG AT 
S S S V G G T 



GG AAG CAGAGAATG ACCG ACT 
E A E N D R 



TTCACTTGAGCTTCAGGATGT 
F H L S F R M 



CGTCCTCCGTGGGCACTGATG 
S S S V G G T 



G AAGCAG AGAATG ACCGACT G 
E A E N D R I* 



T CACTTGAGCTT CAGGAT GTT 
H L S F R M 



Fig. 20 (cont'd 1) 



T2 



TAAAAGGTAAAAATGAAAAAC 
AAAAGGTAAAAATGAAAAACA 
AAAGGTAAAAATGAAAAACAA 

AAGGTAAAAATGAAAAACAAA 99/124 

AGGTAAAAATGAAAAACAAAA 

G GT AAAAAT GAAAAAC AAAAA 

GT AAAAAT G AAAAAC AAAAAC 

T AAAAAT G AAAAAC AAAAAC A 

AAAAATGAAAAACAAAAACAA 

AAAATGAAAAACAAAAACAAG 

AAAT GAAAAACAAAAAC AAG C 

AATGAAAAACAAAAACAAGCA 

ATGAAAAACAAAAACAAGCAA 

T GAAAAAC AAAAAC AAG C AAA 

GAAAAACAAAAACAAGCAAAC 

AAAAACAAAAACAAGCAAACA 

AAAACAAAAACAAGCAAACAA 

AAAC AAAAAC AAG C AAA C AAA 

AACAAAAACAAGCAAACAAAC 

ACAAAAACAAGCAAACAAACA 



T2 

:CTCTAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
C CT CT AATTT AACAAT GC AT AAGAGT C AAT AAAC CCT ACTTTTT T AAAAAAAA 
CTCTAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
TCTAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
CTAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
TAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
AATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
ATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
T TTAACAAT GCATAAGAGT C AAT AAAC CCT ACTTTTTT AAAAAAAA 
T T AACAAT GC AT AAG AGT C AAT AAAC C CTACTT TTTT AAAAAAAA 
TAACAATGCAT AAGAGT CAATAAACCCT ACTTTTTT AAAAAAAA 
AACAAT GCATAAGAGT C AAT AAAC C CTACTT TTTT AAAAAAAA 
ACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
CAATGC AT AAGAGT CAATAAACCCT ACTTTTTT AAAAAAAA 
AAT GCATAAGAGT CAATAAACCCT ACTTTTTT AAAAAAAA 
ATGCATAAGAGTCAATAAACC CTACTT TTTT AAAAAAAA 
TGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
GCATAAGAGTCAATAAACC CTACTT TTTT AAAAAAAA 
CATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
AT AAGAGT CAAT AAAC C CTACTTT TTT AAAAAAAA 
TAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 



Fig. 20 (cont'd 2) 
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T3 

ACTGGGCCAATCATTACCTAG 
W A N H Y L. 

CTGGGCCAATCATTACCTAGC 
W A N H Y L 

TGGGCCAATCATTACCTAGCC 
W A N H Y 1m A 

GGGC C AAT CAT T AC CT AGCC A 
W A N H Y L A 

GG CC AAT C ATT AC CT AGCC AA 
A N H Y L A 

GC C AAT CATT AC CT AGC C AAA 
A N H Y L A K 

CCAAT CATT AC CT AGCC AAAT 
A N H Y L A K 



T3 



C GGCCAC AAGCGT CT CAT C AG 
G H K R L I 



GGCCACAAGCGTCTCATCAGG 
G H K R L I R 



GCCACAAGCGTCTCATCAGGG 
G H K R L I R 



CCACAAGCGTCTCATCAGGGA 
H K R Ij I R 



CACAAGCGTCTCATCAGGGAT 
H K R L I R D 



ACAAGCGTCTCATCAGGGATC 
H K R L I R D 



CAAGCGT CTCAT C AGGGAT CT 
K R Ii X R D 



Q J 9 f 9i- :!!, 1L !H5 

T3 



CT G AAAT GC AGT C C AG ACT T C 
E M Q S R L 

TGAAATGCAGTCCAGACTTCC 
E M Q S R L 

G AAAT G CAGT C CAG AC TT C C A 
E M Q S R L P 

AAATGCAGTCCAGACTTCCAG 
E M Q S R Ii P 

AATGCAGTCCAGACTTCCAGG 
M Q S R* L P 

ATGCAGTCCAGACTTCCAGGT 
M Q S R L. P G 



CAATCATTACCTAGCCAAATC 
N H Y Ii A K 



AAT CAT TACCT AGC CAAATC C 
NHYLAKS 



ATCATTACCTAGCCAAATCCG 
NHYLAKS 



TCATTACCTAGCCAAATCCGG 
H Y Ii A K S 



CATTACCTAGCCAAATCCGGC 
H Y Ii A K S G 



ATTACCTAGCCAAATCCGGCC 
H Y I. A K S G 



TTACCTAGCCAAATCCGGCCA 
Y Ii A K S G 



AAGCGT CT CAT C AGGGAT CTC 
K R L I R D L 



AGCGT CTCAT CAGGGAT CTC C 
K R la I R D L 



GCGTCT CATCAGGGATCT C CA 
R L I R D L 



CGTCT CAT C AGGGATCTC CAG 
R L I R D L Q 



GT CT C ATCAGGGAT CT C C AGC 
R L I R D L Q 



T CT CAT CAGGGATCTC CAGCA 
L I R D L Q 



CTCAT CAGGGAT CT CCAGC AA 
L I R D L Q Q 



TGCAGTCCAGACTTCCAGGTC 
M Q S R L P G 



GCAGTCCAGACTTCCAGGTCC 
Q S R L P G 



CAGTCCAGACTTCCAGGTCCT 
Q S R L P G P 



AGTCCAGACTTCCAGGTCCTA 
Q S R It P G P 



GTCCAGACTTCCAGGTCCTAC 
S R I* P G P 



TCCAGACTTCCAGGTCCTACC 
S R Ii P G P T 



TACCT AGCCAAAT C CGGCC AC 
Y I* A K S G H 



ACCTAGCCAAATCCGGCCACA 
Y Ii A K S G H 



CCTAGCCAAATCCGGCCACAA 
Hi A K S G H 



CTAGCCAAATC CGGCC AC AAG 
I* A K S G H K 



T AGCCAAAT CCGGC C AC AAGC 
L A K S G H K 



AGCCAAATCCGGCCACAAGCG 
A K S G H K 



GCC AAAT CCGGCCACAAGCGT 
A K S G H K R 



T CATCAGGGATCT CCAGC AAG 
L I R D L Q Q 

CATCAGGGATCTCCAGCAAGA 
I R D L, Q Q 

AT CAGGGAT CT C CAGCAAGAT 
I R D L Q Q D 

T C AGGGATCT C CAGCAAGATG 
I R D L Q Q D 

CAGGGAT CT CCAGCAAG ATGT 
R D L Q Q D 

AGGGAT CT C CAGCAAGAT GTG 
R D L Q Q D V 

GGG AT CT C CAGCAAGAT GTGA 
R D I. Q Q D V 



CC AG ACTTCCAGGT CCT AC CG 
S R L P G P T 



CAGACTTCCAGGTCCTACCGC 
R Ii P G P T 



AGACTT CCAGGT CCT AC CGCG 
R Ii P G P T A 



GACTT CCAGGT CCT AC CGCGA 
R L P G P T A 



ACTTCCAGGTCCTACCGCGAG 
L P G P T A 



CTT CCAGGT CCT AC CGCGAGG 
L. P G P T A R 



TTCCAGGTCCTACCGCGAGGG 
L P G P T A R 



Fig. 20 (cont'd 3) 



T C C AGGT CCT AC CGCG AGGGT 
P G P T A R 



CCAGGT CCT ACCGCG AGGGT A 
P G P T A R V 



T3 v ^ 

CGGGGCAGT AGTGT CTGC C AC 
R G S S V C H 

GGGGCAGTAGTGTCTGCCACG 
R G S S V C H 

GGGCAGTAGTGTCTGCCACGT 
G S S V C H 

GGCAGTAGTGTCTGCCACGTG 
G S S V C H V 

GCAGT AGTGT CT G CCACGT GG 
G S S V C H V 

CAGTAGTGTCTGCCACGTGGA 
S S V C H V 

AGTAGTGTCTGCCACGTGGAC 
S S V C H V D 

GT AGTGT CT GC C ACGT GGACG 
S S V C H V D 

a AGTGT CT GCCACGT GGAC GT 
S V C H V D 

AGTGTCTGC CACGTGGACGT C 
S V C H V D V 

GTGT CTGC C ACGT GGAC GTCT 
S V C H V D V 

TGTCTGCCACGTGGACGTCTC 
V C H V D V 



GTCTGCCACGTGGACGTCTCA 
V C H V D V S 



T CT GC CACGTGGACGT CT C AG 
V C H V D V S 



CTGCCACGTGGACGTCTCAGA 
C H V D V S 



TGCCACGTGGACGTCTCAGAC 
C H V D V S D 



GCCACGTGGACGT CT CAGACA 
C H V D V S D 



C CAC GTGGACGT CT C AG AC AA 
H V D V S D 
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T3 



T CACC ATGCC AAGGACG AAGG 
T M P R T K 

CAC C AT GC C AAGG ACGAAGGC 
T M P R T K 

AC C ATGCC AAGG ACGAAGGCT 
T M P R T K A 

C CAT GCC AAGGACG AAGGCTT 
T M P R T K A 

C ATGCC AAGGACG AAGGCTT C 
M P R T K A 

ATGCCAAGGACGAAGGCTTCA 
M P R T K A S 

TGCCAAGGACG AAGGCTT C AG 
M P R T K A S 

GCCAAGGACGAAGGCTTCAGC 
P R T K A S 

CCAAGGACGAAGGCTTCAGCC 
P R T K A S A 

C AAGG ACG AAGG CTTCAGCC C 
P R T K A S A 

AAGGACGAAGGCTT C AGC C C C 
R T K A S A 

AGGACGAAGGCTTCAGCCCCG 
R T K A S A P 

GGACG AAGGCTT CAGC CCCGG 
R T K A S A P 

GACGAAGGCTTCAGCCCCGGC 
T K A S A P 

ACGAAGGCTTCAGCCCCGGCA 
T K A S A P A 

CGAAGGCTTCAGCCCCGGCAG 
T K A S A P A 

GAAGGCTT C AGCCCCGGC AGG 
K A S A P A 

AAGGCTTCAGCCCCGGCAGGC 
K A S A P A G 

AGGCTTCAGCCCCGGCAGGCG 
K A S A P A G 

GGCTTC AGCCC CGGC AGGCGC 
A S A P A G 

GCTT C AGCC CCGGC AGGC GC A 
A S A P A G A 



AGAAGCAGAGTGGTTCCGCCA 
K Q S G S A 

G AAGC AG AGTGGT T C CG CC AC 
K Q S G~ S A 

AAGC AGAGT GGTT C CGCCACC 
K Q S G S A T 

AG C AG AGTGGTT C CGC C AC CG 
K Q S G S A T 

G C AG AGTGGTT C CGC C ACCGG 
Q S G S A T 

CAGAGTGGTTCCGCCACCGGC 
Q S G S **A T G 

AGAGT GGTT C CGCCACCGGC C 
Q S G S A T G 

GAGTGGTTCCGCCACCGGCCT 
S G S A T G 

AGTGGTTCCGCCACCGGCCTG 
S G S A T G L 

GTGGTTCCGCCACCGGCCTGG 
S G S A T G L 

TGGTTCCGCCACCGGCCTGGC 
G S A T G L 

GGTT CC GCC ACCGGCCTGGCC 
G S A T G L A 

GT T CCGC CAC CGGCCTGGC CA 
G S A T G L A 

T T C CGC C ACCGGC CTGGCCAT 
S A T G Ii A 

TTCCGCCACCGGC CTGGCCAT 
S A T G Ii A 

TC CGCCACCGGC CTGGCCAT G 
S A T G L A M 

CC GC C AC CGGCCT GGCCATG A 
S A T G Ii A M 

CGCCACCGGCCTGGCCATGAT 
A T G L A M 

GC CACCGGCCTGGCCATGATC 
AT G L A M I 

CCACCGGCCTGGCCATGATCA 
T G Ii A M I 

CACC GGCCTGGCCATGAT CAC 
T G L A M I 



CACGTGGACGTCT CAG AC AAG 
H V D V S D K 



AC CGGCCTGGC CAT GAT CAC A 
T G L A M I T 



ACGTGGAC GT CT C AGAC AAGG 
H V D V S D K 



CGTGGACGTCTCAGACAAGGC 
V D V S D K 



Fig. 20 (cont f d 4) 



T3 - - c 

GGTCTGGTCAACCAAACAGAC 
G L V N Q T D 

GTCTGGTCAACCAAACAGACA 
G L V M Q T D 



T3 
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TTCATGGATCCTCACTCTCCT 
H G S S I* S 

T C ATGG AT CCT C ACTCTCCTT 
H G S S L< S ^ 



!!"■!! iqi 



l! 3 a„ i! ^s wp^^ 9 



CT CCTT GGT T T C C AGC AC AT C 
S L V S S T 



TCCTTGGTTTCCAGCACATCG 
S L V S T S 



TCTGGT C AAC C AAACAG AC AA 
" L V N Q T D 

C T GGT C AAC C AAACAG AC AAG 
L V N Q T D K 

T GGT C AAC C AAACAG ACAAGG 
L V N Q T D K 



C AT GG AT C CT CACT CT CCTT G 
H G S S L S L 



ATGGATCCTCACTCTCCTTGG 
H G S S I* S L 



TGGATCCTCACTCTCCTTGGT 
G S S L S L 



CCT T GGT TT CC AGC AC ATCGT 
3 I, V S S T S 



CTTGGTTTCC AGC AC ATCGT C 
li V S S T S 



TT GGTT T C C AGCAC AT CGTC A 
L V S S T S S 



GGT C AAC C AAAC AGACAAGG A 
V N Q T D K 

GTGAACCAAACAGACAAGGAG 
V N Q T D K E 



GGATCCTCACTCTCCTTGGTT 
G S S L S L V 



GAT CCT CACT CTCCTTGGTTT 
G S S L S L V 



TGGTTTCCAGCACATCGTCAG 
L V S S T S S 



GGTT TCC AGC AC AT CGT C AGT 
V S S T S S 



TCAACCAAACAGACAAGGAGA 
V N Q T D K E 

CAAC CAAAC AG ACAAGGAGAA 
M Q T D K E 

CCAAACAGACAAGGAGAAA 
N Q T D K E K 

AC CAAACAG AC AAGGAGAAAG 
K Q T D K E K 

C CAAAC AGACAAGGAGAAAGG 
Q T D K E K 

CAAACAGACAAGGAGAAAGGC 
Q T D K E K G 

AAACAGACAAGGAGAAAGGCA 
Q T D K E K G 

AACAGACAAGGAGAAAGGCAT 
T D K E K G 

AC AGACAAGG AGAAAGGC AT C 
T D K E K G I 

CAGACAAGGAGAAAGGCATCT 
T D K E K G I 

^ AC AAGG AG AAAGGCATCT C 
D K E K G I 

GACAAGGAG AAAGGCAT CT C A 
D K E K G I S 

AC AAGGAG AAAGGCAT CT CAT 
D K E K G I S 



ATCCTCACTCTCCTTGGTTTC 
S S L S L V 



TCCT CACT CTC CTTGGTTTCC 

s s h s h v s 



CCTCACTCTCCTTGGTTTCCA 
S S If S L V S 



CTCACTCTCCTTGGTTTCCAG 
S L S L V S 



TCACTCTCCTTGGTTT CCAGC 
S I> S I* V S S 



CACT CT CCTT GGT TTCCAGCA 
S L S L V S S 



ACT CT C CT T GGTTT C CAGCAC 
L S L V S S 



CTCTCCTTGGTTTCCAGCACA 
L S L V S S T 



TCT C CTTGGTTTCCAGCACAT 
L S I* V S S T 



CTCCTT GGTTT CCAGCACATC 
S I* V S S T 



TCCTTGGTTTCCAGCACATCG 
S L V S S T S 



CCTTGGTTTCCAGCACATCGT 
S L V S S T S 



CTTGGTTTCCAGCACATCGTC 
L V S S T S 



GTTTCCAGCACATCGTCAGTT 
V S S T S S V 



TTTCCAGCACATCGTCAGTTT 
V S S T S S V 



TT CCAGCACATCGT CAGTTT A 
S S T S S V 



TCC AGC AC AT CGT CAGTTT AT 

s s t s s v y 



CCAGCACATCGT CAGTTTATT 
S S T S S V Y 



CAGCACAT CGTCAGT TTATTC 
S T S S V Y 



AGC ACAT CGT CAGTTT ATT CT 
S T S S V Y S 



GCACAT CGT CAGTTT ATT CT A 
S T S S V Y S 



CACATCGT CAGTTT ATT CT AC 
T S S V Y S 



ACATCGTCAGTTTAT TCT ACA 
T S S V Y S T 



CATCGT CAGTTT ATT CTACAC 
T S S V Y S T 



AT CGT CAGTTT ATT CTACAC C 
S S V Y S T 



TCGT CAGTTT ATT CTACAC C A 
S S V" Y S T P 



CAAGGAGAAAGGCAT CT CAT C 
K E K G I S 



TTGGTTTCCAGCACATCGTCA 
L V S S T S S 



CGT CAGTTT AT T CT ACACCAG 
S S V Y S T P 



T3 



Fig. 20 (cont'd 5) 



GGAAGAACTGGGTCAATGAGTTACGCAGCTCC 
KNWVNELRSS 



AAGAAGAAGAGGAAGAACTGGGTCAATGAGTTACGCAGCTCCTTCAAG 103 / 124 

KKKRKNWVNELRS SFK 

T3 

TCTCTAATTTCAGAATGCATGGATA 
T3 

AGGAGATGAAGCTGACAGATATCCGCTTAGAAGCTCT 
T3 

GATTCCAGACCACACGTCTTTCTTATCG 



Fig. 20 (cont 'd 6) 
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Alignment of the T protein family 



f?¥¥5 



Note: The N-terminus of protein T2 was omitted in the alignment, since it 
has no significant homology to the T protein and the T3 protein. 



T - - -MDLSSEMNRHGKNPVSHKLEDQK KI YTDWANHYLAKSGHKRLIKDLQ 60 

T3 NQPERLNSQVLQGL.QEPAGEGLPLRKSGSVENGFDTQIYTDWANHYLAKSGHKRLIRDLQ 

T2 



* * 



^ * * *******************.*** 



T QDIADGVLLAEIIQIIANEKVEDINGCPRSQSQMIENVDVC^ 120 

T3 QDVTDGVLLAQIIQWANEKIEDINGCPK1JRSQMIENIDACLNFIJVAKGINIQGLSAEEI 

T2 

** ****** *** _**** ^ ******* . ******_* ** ****,*_*.******** 

T RNGNLKA I LGL F F S LS RYKQQQ - HHQQQ Y YQS LVELQQRVT 18 0 

T3 RNGNLKA I LGL FFSLSRYKQQQQQPQKQHLSS PL PPAVSQVAGAPSQCQAGTPQQQVPVT 

T2 



********************** . *.* * 



* * * 



T HASP PSEASQAKTQQDMQSRLPGP - SRVPAAGS S SKVQGASNLN - - RRSQS FNS I 240 

T3 pQAPCQPHQPAPHQQSKAQAEMQSRLPGPTARVST^GSEAKTRGGSTTANNRRSQSFNNY 

T2 

* * * * * ******** .** **** .* .* *. ******* 

T DKNK PP- 300 

T3 DKSKPVTSPPPPPSSHEKEPLASSASSHPGMSDNAPASLESGSSSTPTNCSTSSAIPQPG 

T2 

* * * * * 

T N 360 

T3 AATKPWRSKSLSVKHSATVSMLSVKPPGPEAPRPTPEAMKPAPNNQKSMLEKLKLFNSKG 

T2 

* 

T 420 

T3 GSKAGEGPGSRDTSCERLETLPSFEESEELEAASRMLTTVGPASSSPKIALKGIAQRTFS 

T2 " 



T --YAN GNEK 

T3 RALTNKKSSLKGNEKEKEKQQREKDKE^ 

T2 ~ 

* * * * * 



T3 PKKSSKIASFIPKGGKLNSAKKEPMAPSHSGIPKPGMKSMPGKSPSAPAPSKEGERSRSG 
T2 



T 600 

T3 KLSSGLPQQKPQLDGRHSSSSSSLASSEGKGPGGTTLNHSISSQTVSGSVGTTQTTGSNT 

T2 " 



Fig. 21 
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x *660 

T3 VSVQLPQPQQQYNHPNTATVAPFLYRSQTDTEGNVTAESSSTGVSVEPSHFTKTGQPALE 

T2 

T GEDPETRRMRTVKNIADLRQNLEETMSSLRGTQISHSTLETTFDSTVTTEVNGRTIP 72 0 

T3 ELTGEDPEARRLRTVKNIADLRQNLEETMSSLRGTQVTHSTLETTFDTNVTTEMSGRSIL 

x2 DPESQRKRTVQNVLDLRQNLEETMSSLRGSQVTHSSLEMTCYD--SDDANPRSVS 

***** * * * * ^ * ^ ***************,*_.**.** * . . *.. 

T NLTSRPTPMTWRLGQACPRIiQAGDAPSLGAGYP-RSGTSRFIHTDPSRFMYTTPLRRAAV 780 
T3 SLTGRPTPLSWRLGQSSPRLQAGDAPSMGNGYPPRANASRFINTESGRYVYSAPLRRQLA 
T2 SLSNRSYPLSWRYGQSSPRLQAGDAPSVGGSCRSEGTPAWYMHGERAHYSHTMPMR- - SP 
* * *_** ************_* *.* 

T SRLGNMSQIDMSEKA-SSDLDMS-SEVDVGGYMSDGDIIiGKSLRTDDINSGYMTDGGLNL 840 
T3 SRGS SVCHVDVSDKA - GDEMDLEGI SMDAPGYMSDGDVLS KNI RTDD I TSGYMTDGGLGL 

T2 SKLSHISRLELVESLDSDEVDLK SGYMSDSDLMGKTMTEDDD ITTG 

^ * ********* * * 



T YTRSLNRIPD-TATSRDIIQRGVHDVTVDADSWDDSSSVSSGLSDTLDNISTDDLNTTSS 900 
T3 YTRRLNRLPDGMAWRETLQRNTSLGLGDAD SWDDS S SVS SGI SDTI DNLSTDD INTS S S 

T2 WDESSSISSGLSDASDNLSSEEFNASSS 

** *** ★**,*★, * * * _ *_** 

T VSSYSNITVPSRKN — TQLRTDSEKRSTTDET- -WDSP- -EELKKPE- -EDFDSHGDAG- 960 

T3 ISSYANTPASSRKN- -LDVQTDAEKHSQVERNSLWSG DDVKKSDGGSDSGIKMEPG- 

T2 LNSLPSTPTASRRNSTIVLRTDSEKRSLAESGLSWFSESEEKAPKKLEYDSGSLKMEPGT 
* **.* _** **.*. * . * . . * 

T GKWKTVSSGLPEDPEK-AGQKASLSVSQTGSWRRGMSAQGGAPS RQKAGTSALKTP - 1020 

T3 SKWRRNPSDVSDESDKSTSGKKNPVISQTGSWRRGMTAQVGITMPRTKASAPAGALKTPG 
T2 S KWRRE R PESCDDSS KGGE L KKP I S LGH PG S L KKGKT P P VAVT S P - - I THTAQS ALKVAG 
** * * ** ..* . - *** 

T -GKTDDAKASEKGKAPLKGSSLQRSPSDAGKSSGDEGKK- -PPSGIGRSTATSSFGFKKP 108 0 
T3 TG KTDDAKV S E KGRL S P KAS Q VKRS P S DAGR S S GDE S KKP L P S S S RT PT ANANS FGF KKQ 

T2 KPEGKATDKGKLAVKNTGLQRSSSDAGRDRLSDAKK- - PPSGIARPSTSGSFGYKKP 

* ** * , ..** ****. . ** * * . ***,** 

T SG - VGS SAM I TS SGAT ITSGSATLGKI PKS AAIGGKSNAGRKTSLDGSQNQDDVVLHVS S 1140 
T3 SGSATGLAMITASGVTVTSRSATLGKIPKSSALVSRS-AGRKSSMDGAQNQDDGYLALSS 

T2 PP-ATGTATVMQTG GSATLSKIQKSSGIPVKPVNGRKTSLDVSNSAEPGFLAPGA 

* * ********.. . *****_ , * 

T KTTLQYRSLPRPSKSSTSGIPGR-GGHRSSTSSID- SNVSSKSAGATTSKLREPTKIGSG 12 00 
T3 RTNLQYRSLPRPSKSNSR- -NG- -AGNRSSTSSID- SNISSKSAGLPVPKLREPSKTALG 
T2 RSNIQYRSLPRPAKSSSMSVTGGRGGPRPVSSSIDPSLLSTKQGGLTPSRLKEPTKVASG 
_._********.★* . * * * .**** * * _*,**,* * 

T RS S PVTVNQTDKEKEKVAVSDSESVSLSG - S PKSS PTSASACG - AQGLRQPGSKYPDI AS 12 60 
T3 S S Li PGL VNQTD KE KG - -IS SDNESVAS CN - S VKVNPAAQPVS S P AQTS LQPGAKY PDVAS 

T2 RTTPAPVNQTDREKE KAKAKAVALDSDNI SLKS IGSPESTPKNQASHPTATKLAELP 

******** . . * . . . 

T PTFRRLFGAKAGGKSASAPNTEGVKSSSVMPSPSTTLARQGSLESPSSGTGSMGSAC^LS 13 2 0 
T3 PTIjRRLFGGKP - TKQVPI ATAENMKNS WI SNPHATMTQQGNLDS PS - GSGVLS - - — -S 

T2 PTPLRAT- AKSFVKPPSIiANLDKVN-SNSLDLPSSSDTTHASKVPDLHATSSAS 

*** * * • -" 



Fin 91 ( rnnt 1 d 1) 
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T GSSSPLFNKPSDLTTDVISLSHSLASSPASVHSFTSGGLVWAANMSSSSAGSKDTPSYQS 13 8 0 

T 3 GS S S PL YS KNVDLN QS PLAS S PS SAHSAPSNS LTWGTNAS S S SAVS KDGLGFQS 

T2 --GGPLPS CFTPSPAPILNINSASFSQGLELMSGFSVPKETRMYPK 

* * * * * * _ * 

T MTSLHTSSESIDLPLS HHGSLSGLTTG THEVQSLLMRTGSVRSTLSES - - 1440 

T3 VSSLHTSCESIDISLrSSGGVPSHNSSTGLIASS KDDSLTPFVRTNSVKTTLSESPL 

T2 LSGLHRSMESLQMPMSLPSAFPSSTPVPTPPAPPAAPTEEETEELTWSGSPRAGQLDS - - 
***** * ^ t . . * . . . * 

T MQLDRNTLPKKGLRYTPSSRQANQEEGKEWLRSHSTGGL 150 0 

T3 S S PAAS PKFCRS TLPRKQDSD PHLDRNTL PKKGLRYT PTS QLRTQEDAKEWLRSHS AGGL 

T2 NQRDRNTLPKKGLRYQLQS QEETKERRHSHTlGGL 

************ * **** ***** 

T QDTGNQS PLVS PSAMS S SAAGKYHFSNLVS PTNLSQF3STLPGPSMMRSNS I PAQDS S FDL Y 1560 
T3 QDTAANSPFSSGSSVTSPSGTRFNFSQLASPTTVTQMSLSNPTMLRTHSLSNADGQYDPY 

T2 PESDDQSELPSPPALPMSLSAKGQLTNIVSPTAAT TPRITRSNSIPTHEAAFELY 

. * * .. - ...*** . *.*..*. . 

T DDSQLCGSATSLEERPRAISHSGSFRDSMEEVHGSSLSLVSSTSSLYSTAEEKAHSEQIH 162 0 
T3 TDSRFRNSSMSLDEKSRTMSRSGSFRDGFEEVHGSSLSLVSSTSSVYSTPEEKCQSE-IR 
T2 SGSQMG-STLSIiAERPKGMIRSGSFRDPTDDVHGSVLSLASSASSTYSSAEERMQSEQIR 

* * ** *_ ****** t * * * * *** **_** * * ^ **. _** * ^ 

T KXjRRELVASQEK^ATLTSQLSANAHLVAAFEKSLGlSnyiTGRLQSLTMTAEQKESEL 168 0 

T3 KLRRELDASQEK?/SALTTQLTANAHLVAAFEQSLGlSn^TIRLQSLTMTAEQKDSELNEIiRK 
T2 KLRRELESSQEK^ATLTSQLSANANLVAAFEQSLVl^TSRLRHLAETAEEKDTELLDLRE 
****** ******************** *** * * * ****** ** 

T T I EMLKAQNS AAQAA I QGALNGPDH P PK DLRIRRQHSSESVSSINSATSHSS 174 0 

T3 TIELLKKQNAAAQAAINGVINTPELNCKGNGTAQSADLRIRRQHSSDSVSSINSATSHSS 

T2 T I D F LKKKNS EAQAV I QGALNAS E TT PK ELRIKRQNSSDSISSLNSITSHSS 

** ** * *** ** * m * ************ ***** 

T I G S GNDAD S KKKKKKNWL RSSFKQAFGKKKSTKPPSSHSDIEELT- -DSSLPASPKL 1800 

T3 VGSNIESDSKKKKRKfcrm^ELRSSFKG^ 

T2 IGSSKDADAKKKKKKSWL. RSSFNKAFSIKKGPKSASSYSDIEEIATPDSSAPSSPKL 

** ****** * ^ **** * * ** * ** *****^ *** ***** 

T PHNAGDCGSASMKPS QS ASA I CECTEA EAE 1 1 IiQLKSELRE 1860 

T3 PHNGSTGSTPIiLRNSHSNSLISECMDS EAETVMQLRNELRD 

T2 QHGSTETASPSIKSSTSSSVGTDVTEGPAHPAPHTRLFHANEEEEPEKKEVSELRSELWE 
* * ..*.**. 

T KELKLTD I RLEAL S S AHHLDQ I REAMNRMQNE I E I LKAENDRLKAETGNTAKPTRP P S E S 1920 
T3 KEMKLTDIRLEALSSAHQLDQLREAMNRMQSEIEKLKAENDRLKSES-QGSGCSRAPSQV 

T2 KEMKLTDIRLEALNSAHQLDQLRETMHNMQIjEVDLLEAElsrDRLKVAP- -GPSSGSTPGQV 
************ *********^ *** * ******* * 

T SSSTSSSSSRQSLGLSLNNLNITEAVSSDILLDDAGDATGHKDG-RSVKIIVSISKGYGR 198 0 

T3 SISAS PRQSMGLSQHSLNLTESTSL.DMLLDDTGECSARKEGGRHVKIWSFQEEMKW 

T2 PGSSAIjSSPRRSLGIjALTHSFGPSIjADTDLSPMDGISTCGPKEE- VTLRVVVRMPPQHII 

* * * ^ * * ^ * _ * * _ . . . . * 
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T AKDQKSQAYIjIGSIGVSGKTKWDVLDGVIRRLFKEYVFRIDTSTSLGLiSSDCIASYCIGD 2 040 
T3 KEDSRPHL.FLIGCIGVSGKTKWDVLDGWRRLFKEYIIHVDPVSQLGLNSDSVLGYSIGE 
T2 KGDLKQQEFFLGCSKVSGKVDWKMLDEAVFQVFKDYISKMDPASTLGLSTESIHGYSISH 

* * * * * * * , * * . ..**.*. . . * . ** * .... * . * 

T LIRSHNLEVPELLPCGYLVGDNNIITVNLKGVEENSLDSFVFDTLIPKPITQRYFNLLME 2100 
T3 IKRSNTSETPELLPCGYLVGENTTISVTVKGLAENSLDSLVFESLIPKPILQRYVSLLIE 
T2 VKRVLDAEPPEMPPCRRGVNN ISVSLKGLKEKCVDSLVFETLIPKPMMQHYISLLLK 

* * ** ** * . *********..*****.*.* ** . 

T HHR 1 I LSGPSGTGKTYLANKLAE YV I TKSGRKKTEDAI ATFNVDHKS S KE LQQYLANLAE 2160 
T3 HRRI I LSGPSGTGKTYLANRLSEY I VLREGRELTDGVI ATFNVDHKS SKELRQYLSNLAD 
T2 HRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTEGIVSTFNMHQQSCKDLQLYLSNLAN 

* *_************.*.*.**.. . * * * . ..***. ..*.*.*. **.**** 

T QCSADNNGVELPWIILDNLHHVGSLSDIFNGFLNCKYNKCPYIIGTMNQGVSSSPNLEL 2220 
T3 QCNSENNAVDMPLVIILDNLHHVSSLGEIFNGLLNCKYHKCPYIIGTMNQATSSTPNLQL 
T2 QIDRETGIGDVPLVILLDDLSEAGSISELVNGALTCKYHKCPYIIGTTNQPVKMTPNHGF 
^ * * ** * * ^ ** * *** ******** *★ ** 

T HHNFRWVLCANHTE PVKGF LGRYLRRKL I E I E T ERN I RNNDLVKI I DW I P KTWHHLNS FL 228 0 
T3 HHNFRWVLCANHTE PVKGFLGRFLRRKLMETEISGRVRNMELVKI IDWI PKVWHHLNRFL 
T2 HLSFRMLTFSNNVEPANGFLVRYLRRKLVESDSDINANKEELLRVLDWVPKLWYHLHTFL 

* ** * ** *** *_***★*.* . .*....**.** * ** - ** 

T ETHSSSDVTIGPRLFLPCPMDVEGSRVWF1VIDLWNYSLVPYILEAVREGLQMYGKRTPWED 2340 
T3 EAHSSSDVTIGPRLFLSCPIDVDGSRVWFTDLWNYSIIPYLLEAVREGLQLYGRRAPWED 
T2 EKHSTSDFLIGPCFFLSCPIGIEDFRTWFIDLWNNSIIPYLQEGAKDGIKVHGQKAAWED 

* **.** *** ** **. * ** **** *..**. * ..*... *... *** 

T psKWVLDTYPWSSATLPQESPALLQLRPEDVGYESCTSTKEATTSKHIPQTDTEGDPLMN 2400 
T3 PAKWVMDTYPWAASPQQHEWPPLLQLRPEDVGFDGYSMPREGSTSKQMPPSDAEGDPLMN 
T2 PVEWVRDTLPWPSAQQDQSK- -LYHLPPPTVGPHSIASPPEDRTVKDSTPSSLDSDPLMA 
*******. . *,**** . * * * . **** 



T MLMKLQEAANYSSTQSCDSES- -TSHHEDILDSSLESTL 
T3 MLMRLQEAANYS S PQS YDSDSNSNSHHDD I LDS S LESTL 

T2 MLLKLQEAANYIE - - SPDRET ILDPNLQATL 

**_******* * * .. *** *_** 
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Alignment of the T protein with the POM121 protein 
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T-Protein 
P0M121 



^LSSEMNRHGKfcTPVSHKLEDQKKIYTDWAl^^ 



T*Protein QIOJtflEKVEDINGCPRSQSQMIE^ 

POM12 1 MS P AAAAADGGERRRP PLGVREGRGR - TOGCGGPAG AAAL GIiAL LGLAL 



T-Protein 
P0M121 



SLSRYKQQQHHQQQYYQSLVELQQRVTHASPPSEASQAKTQQDMQSRLPGPSRVPAAGSS 
YLV P- - -AAAALAVTLAVGASAAWWGIiSREPRGP - - 



T-Protein 
tfl21 



SKVQGASNIiNRRSQSFNSIDKNKPP^ 

RGIiSSFVRBSR RHPRPALTASPLPAKSP VNGSLCBPRS 



* * * 



T-Protein 
POM121 



SIxRGTQISHSTLETTPDSTVTTEVNGRTIPWIjTSRPTPMTWRLGQACPRLQAGDAPSLGA 
plggpdpaelllmgsylg KPGPPEPALPQD-PRDRPGRRPPSRS 



T-Protein GYPRSGTSRPIHTDPSRFMYTTPLRRAAVSRIiGNMSQIDMSEKASSDLDMSSEVDVGG^ 
POM121 PPSSSTAQRVHHVYP ALPTPLKRPSRR PPHRDCGPLS 



* * * * 



** * * 



T-Protein 
POM121 



' Protein 
POM121 



T-Protein 
POM121 



T-Protein 
POM121 



S DGD ILGKS LRTDDINSGYMTDGGLNliYTRS LNRI PDTATS RD 1 1 QRGVHDVTVDADS WD 
SRFVI TPR - RRYP I QQAQYSLLGALPTVCWNGGHKKAVLSARNS - RMVCS PVTVRI APPD 



D S S SVS S GL SDTLDN I STDDLNTTSS VS S YSNITVP SRKNTQLRTDSE KRS TTDETWD S P 
SKLFRSPMPEQILSTTLSSPSSITAPDPCAKETV^ 



■t * * * * * 



EELKKPEKDFDSHGDAGGKWKTVS S GLPEDPEKAGQKAS L SVS QTGS WRRGMS AQGGAP S 
LHUDGQENKRRRHDSSG SGHSAFEPLVANGVPAAFVPKPGSLKRSLASQSSDDH 



* * 



RQK-AGTSAIiKTPGKXDDAKASEI<GK:-APLKGSSLQRSPSDAGKSSGDEGKKPPSGIGRS 
LNKRSRTSSVSSLTSTCTGGIPSSSRNAITSSYSSTRGVSQLWKRSG-PTSSPFSSPASS 



-A- * * * * * * * 



T-Protein TATSSFGPKKPSGVGSSAMITSSGATITSGSATIjGKIPKSAAIGG^NAGRKTSLDGSQN 
POM121 RSQTPERPAKKTREEEPCHQSSSSAPIjVTPKESPGEKVTDPATGKQQSLWSPPTPGSSG 



T-Protein QDDVVLHVSSK7ITLQYRSLPRPSKSSTSGIPGRG 

POM 12 1 QRKRKlQIiLPSRRGDQLTLPPP P- -ELG- - YSITAEDLDMERR AS - - »LQ 



Fig. 22 



T- Protein 
POM121 



1145 
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EPTKIGSGRSSPVTVNQTDKEKEKVAVSDSESVSI1SGSPKSSPTSASACGAQGI1RQPGSK 

WFNKVLEDKTDDASTPATDTSP ATSPPFTLTL P TVGPAASPASLPAPSS - 

fr ** * * t . . * * * * _ * * * 



T-Protein YPDXASPTFRRI/FGAKAGGKSASAPNTEGVKSSSWPSPSTTLARQGSL^ 

POM121 OTLIiESliKKMQESPAPSSSEPPE--AATVAAPSPPKTPSIjI J API J VSP 



*** * 



T-Procein SAGGLSGSSSPIiFNK^SDLTTDVISLSHSLASSPASVHSFTSGGLWJU!J^SSSSAGSro 
POM121 I/TG PLASTSSDSKPTTTFLGLASASSATPLTDTICAPGVSQAQLCVSTPAATAP 



it * * * ** 



* * * * 



T - Procein TPSYQSOTSLHTSSESIDLPLSmGSLSGLTTGTHEVQSLLMRTGSWSTLSESMQI^RN 
POM121 SP TPASTLFGMLSPPA5SSSLATPGPACASPMFKPIFPATPKSE SDN 



** * * * 



T-Protein TLPKKGLRYTP S S RQANQEEGKEWLRSHSTGGIjQDTGNQS PLVS P SAMS S S AAGKYHFSN 
DM121 PLP TS S S AATTTP AS TALPTTATAT AHTFKP I FES VE P FAAMP 



T-Protein LVSPTNLSQFmPGPSM^SWSIPAQDSSPDLYDDSQLCGSATSLEERPRAISHSGSFRB 

POM12 1 LSPPFSLKQTTAPATTAATSAPLLTG L GTATST VATGTTAS 

***** i- * *** . * . 



T-PrOtein SMEEVHGSSLSL VS S TS SLYS TAEEKAHSEQ IHKLRRELVAS QEKVATLTS QLS ANAHLV 
POM 12 1 ASKPVFGFGVTTAASTASTIAS TSQS I LFGGAPPVTAS S S APALAS IFQFGKPLA 



T-Protein AAFEKSIiGNMTGRLQSLTMTAEQKESELIEIiRETIEMLK^ 

POM12 1 PAASVAGTSFSQSLASSAQTAASNSS - -GGFSGFGGTIiTTSTSAPATTSQPTLTFSNTVT 



* * ** 



- Protein KDLRIRRQHSSE * S VS S IHSATSHS S IGSGHDAJ3SKKKKIOCbTMljRSSFKQAFGKKKSTK- 
KM12 1 PTFNIPFSASAKPALPTYPGANSQPTFG- ATDGATKP ALAPSFGSSFTFGNS VAS 



T-Protein PPSSHSDIEELTDSSfcPASPKLPHlIAGDCGSAS 

POM121 ... APSAAPAPAAFGGAAQPAFGGLKASASTFG TPASTQPAFGSTTS VFSFGSA 



r* * * * 



T-Protein 
POM121 



LREKELKLTD IRLEALS S AHHLDQIREAMNRKQNE IEI LKAENDRLKAETGNTAKPTRPP 
TTS GFGAAAATTQTTHSGS SSSLFGSSTPS - PF 



T-Protein 
POM121 



SESSSSTSSSSSRQSLGLSLNNI^ITEAVSSDiriLDDAGDA 

TFGGSAAPAGGG GFGLSATPGTGSTSGTFSFGSGQSGT TGTTTSFGGSLSQNT 



* * * * 



T-Protein grakdqksqayligsigvsgktk^vldgvii^ 

POM12 1 LGAPSQSS - - PFAFSVGSTPESKP -VFGGTSTPTFGQSAPAPG V 



* 4- * 



Fig. 22 (cont'd 1) 



T- Protein 
POM121 



T-Protein 
POM121 



T-Protein 

POM121 



T-Protein 
POM121 



T-Protein 



T-Protein 
POM121 



T-Protein 
P0M121 
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GDLIRSHNIiEVPELLPCGYIjVGDNNI ITVNLKGVEENSLDS FVFDTLIPKPITQRYFNLIj 

GTTGS SLS FGAP S TPAQGFVG VGPFGSGAPSPSIGAGSKTPGARQRLQAR 

* x * + # * * * *.... 

MEHHRIILSGPSGTGKTYLANKIiAE^ 

RQHTRKK 

AE QCS ADUNGVELP WI ILDNIiHHVGSIiSD I FNGFLNCKYNKCP YI IGTMNQGVS S S PJSHL 

ET i H H N ir'K WVLCAKHTEP VKGFLGRYLRRKLI E I EI ERN I RNNDLVKT IDWI PKTWHHIiETS 

FI^THSSSDVTIGPRLFLPCPMDVEGSRV^MDLWNYSLVPYIIjEAVREGLQMYGKRTPW 

EDPSKWVIiDTYPWSSATLPQESPALLQLRPED 

MNMLMKLQEAANYS S TQS CDSESTSHHED XLDS SLESTL 



Fig. 22 (cont'd 2) 
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Expression of the T gene family. 

a fetal tissue: left : T gene; middle: T2 gene; right : T3 gene. 

He = heart; Br = brain; Lu = lungs; Li = liver; Ki = kidney 
b adult tissue: left : T gene; middle : T2 gene; right : T3 gene. 

He = heart; Br = brain; PI = placenta; Lu = lungs; Li = liver; Mu = 

skeletal muscle; Ki = kidney; Pa = pancreas 
c adult brain regions: left : T gene; middle : T2 gene; right : T3 gene. 

Cer = cerebellum; Cor = cerebral cortex; Med = medulla; Sco = spinal 

cord; Opo = occipital pole; Flo = frontal lobe; Tlo = temporal lobe; 

Put = putamen S 

Fig. 23 
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Figure legend of immunohisto and electron microscopy: 

a = brain stem. CG central grey = central grey of the brain stem 

b = hippocampus, dg = dental gyrus; CA3 cornu ammonis 3, both subregions 
of the hippocampus formation 

c = electronmicroscopic picture. N = nucleus, He heterochromatii> — 
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r . PATENT APPLICATION ' - 

As a below named inventor, I hereby declare that: 

My residence/post office address and citizenship are as stated below next to my name; 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plura 
names are listed below) of the subject matter, which is claimed and for which a patent is sought on the invention entitled: 
PROTEIN (TP) THAT IS INVOLVED IN THE DEVELOPMENT OF THE NERVOUS SYSTEM 
the specification of which is attached hereto unless the following box is checked: 

(X) was filed on August 24, 2001 as US Application Serial No. o9/914,549 or PCT International Application 

Number and was amended on (if applicable). 

I hereby state that I have reviewed and understood the contents of the above-identified specification, including the claims, as amende 
by any amendment(s) referred to above. I acknowledge the duty to disclose all information which is material to patentability as define 
in 37 CFR 1.56. 

Foreign Applications) and/or Claim of Foreign Priority 

I hereby claim foreign priority benefits under Title 35, United States Code Section 1 19(a-d) or 365(b) of any foreign application(s) for patent or inventor(s) certificate, or 365(a 
of any PCT international application which designated at least one country other than the United States of America, listed below and have also identified below any foreig 
application for patent or inventor(s) certificate having a filing date before that of the application on which priority is claimed: 



COUNTRY 


APPLICATION NUMBER 


DATE FILED 


PRIORITY CLAIMED UNDER 35 U.S.C. 1 19 


Germany 


199 08 423.8 


26 February 1999 


YES: X NO: 


PCT 


PCT/DEOO/00583 


28 February 2000 


YES: X NO: 



Provisional Application 

I hereby claim the benefit under Title 35, United States Code Section 1 19(e) of any United States provisional applications) listed below: 



U.S. Priority Claim 

I hereby claim the benefit under Title 35, United States Code, Section 120 of any United States applications) listed below and, insofar as the subject matter of each of the claim 
of this application is not disclosed in the prior United States application in the manner provided by the first paragraph of Title 35, United States Code Section 1 12, 1 acknowledg 
the duty to disclose material information as defined in Title 37, Code of Federal Regulations, Section 1.56(a) which occurred between the filing date of the prior application an 
the national or PCT international filing date of this application: 



APPLICATION SERIAL NUMBER 


FILING DATE 


S T ATUS(patented/pending/abandoned) 





















POWER OF ATTORNEY: 

As a named inventor, I hereby appoint the following attomey(s) and/or agent(s) listed below to prosecute this application and transact all business in the Patent and Trademar 
Office connected therewith. 



Steven J. Hultquist, Reg. No. 28,021 Marianne Fuierer, Reg. No. 39,983 
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I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to be true; and further that thes 
statements were made with the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of th 
United States Code and tha/such willful false statements may jeopardize the validity of the application or any patent issued thereon. 

v^torcAroiemarie Poustka /KTXx / Citizenship 



Full Name of InW 



Residence: Y&tt*»sfrmmG-Z6, p 69120 Heidelberg^ German] 



Post Office Address: 
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Post Office Address: Same 
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SEQUENCE LISTING 

<110> Poustka, et al . 

<120> Protein (TP) That is Involved in the Development of the Nervous 
System 

<130> 4121-129 

<140> 

<141> 

<150> PCT/DE00/00583 

DE 199 08 423.8 
<151> 2000-02-28 
1999-02-26 

<160> 39 

<170> Patentln Ver . 2.0 



<210> 1 

<211> 242 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<223> (3) . . (242) 



<400> 1 



aaa 

Lys 



47 



95 



ag cga gtt act cac get tec cct cca teg gaa gee age cag gec aaa 
Arg Val Thr His Ala Ser Pro Pro Ser Glu Ala Ser Gin Ala Lys 
X 5 10 15 

acc cag caa gat atg cag tec agt ctg gca gee aga tat gca act cag 
Thr Gin Gin Asp Met Gin Ser Ser Leu Ala Ala Arg Tyr Ala Thr Gin 
20 25 30 

tct aat cac agt gga att gca acc agt caa aaa aag cct act agg ctt 143 
Ser Asn His Ser Gly lie Ala Thr Ser Gin Lys Lys Pro Thr Arg Leu 
35 40 45 

cca ggg ccc tct agg gtg cct get gca gga age age age aag gtc cag 191 
Pro Gly Pro Ser Arg Val Pro Ala Ala Gly Ser Ser Ser Lys Val Gin 
50 55 60 

gga gec tct aat tta aat agg aga agt cag age ttt aac age att gac 239 
Gly Ala Ser Asn Leu Asn Arg Arg Ser Gin Ser Phe Asn Ser lie Asp 
65 70 75 



242 



1 
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80 



<210> 2 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Arg Val Thr His Ala Ser Pro Pro Ser Glu Ala Ser Gin Ala Lys Thr 
15 10 15 



Gin Gin Asp Met Gin Ser Ser Leu Ala Ala Arg Tyr Ala Thr Gin Ser 
20 25 30 

Asn His Ser Gly He Ala Thr Ser Gin Lys Lys Pro Thr Arg Leu Pro 
35 40 45 

Gly Pro Ser Arg Val Pro Ala Ala Gly Ser Ser Ser Lys Val Gin Gly 
50 55 60 

Ala Ser Asn Leu Asn Arg Arg Ser Gin Ser Phe Asn Ser He Asp Lys 
65 70 75 80 



<210> 3 

<211> 159 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<223> (1) - . (159) 



<400> 3 



ggc act cac gag gtc cag age ctg etc atg aga acg ggt agt gtg aga 
Gly Thr His Glu Val Gin Ser Leu Leu Met Arg Thr Gly Ser Val Arg 
5 10 15 

tct act etc tea gaa aga tat ace cca tea tct egg cag gee aac caa 
Ser Thr Leu Ser Glu Arg Tyr Thr Pro Ser Ser Arg Gin Ala Asn Gin 
20 25 30 

gaa gag ggc aaa gag tgg ttg cgt tct cat tct act gga ggg ctt cag 
Glu Glu Gly Lys Glu Trp Leu Arg Ser His Ser Thr Gly Gly Leu Gin 
35 40 45 

gac act ggc aac cag 
Asp Thr Gly Asn Gin 
5 0 



48 



96 



144 



159 



2 
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<210> 4 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 4 



Gly Thr His Glu Val Gin Ser Leu Leu Met Arg Thr Gly Ser Val Arg 
15 10 15 

Ser Thr Leu Ser Glu Arg Tyr Thr Pro Ser Ser Arg Gin Ala Asn Gin 
20 25 30 

Glu Glu Gly Lys Glu Trp Leu Arg Ser His Ser Thr Gly Gly Leu Gin 
35 40 45 

Asp Thr Gly Asn Gin 
50 



<210> 5 

<211> 2461 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<223> (2) . . (1627) 



<400> 5 

g gat cag ctt egg gag acc atg cac aac atg cag ttg gag gtg gac 
Asp Gin Leu Arg Glu Thr Met His Asn Met Gin Leu Glu Val Asp 

ctg ctg aaa gca gag aat gac cga ctg aag gta gec cca ggc ccc tea 
Leu Leu Lys Ala Glu Asn Asp Arg Leu Lys Val Ala Pro Gly Pro Ser 

tea ggc tec act cca ggg cag gtc cct gga tea tct gca tta tct tec 
Ser Gly Ser Thr Pro Gly Gin Val Pro Gly Ser Ser Ala Leu Ser Ser 

cca cgc cgc tec eta ggc ctg gca etc acc cat tec ttc ggc ccc agt 
Pro Arg Arg Ser Leu Gly Leu Ala Leu Thr His Ser Phe Gly Pro Ser 

ctt gca gac aca gac ctg tea ccc atg gat ggc ate agt act tgt ggt 
Leu Ala Asp Thr Asp Leu Ser Pro Met Asp Gly He Ser Thr Cys Gly 

cca aag gag gaa gtg acc etc egg gtg gtg gtg agg atg ccc ccg cag 
Pro Lys Glu Glu Val Thr Leu Arg Val Val Val Arg Met Pro Pro Gin 



46 



94 



142 



190 



238 



286 
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cac ate ate aaa ggg gac ttg aag cag cag gaa ttc ttc ctg ggc tgt 334 
His lie lie Lys Gly Asp Leu Lys Gin Gin Glu Phe Phe Leu Gly Cys 



age aag gtc agt gga aaa gtt gac tgg aag atg ctg gat gaa get gtt 
Ser Lys Val Ser Gly Lys Val Asp Trp Lys Met Leu Asp Glu Ala Val 

ttc caa gtg ttc aag gac tat att tct aaa atg gac cca gee tct ace 
Phe Gin Val Phe Lys Asp Tyr He Ser Lys Met Asp Pro Ala Ser Thr 

ctg gga eta age act gag tec ate cat ggc tac age ate age cac gtg 
Leu Gly Leu Ser Thr Glu Ser He His Gly Tyr Ser He Ser His Val 



gtc gac age ctg gtg ttc gag acg ctg ate ccc aag ccg atg atg cag 
Val Asp Ser Leu Val Phe Glu Thr Leu He Pro Lys Pro Met Met Gin 

cac tac ata age etc ctg ctg aag cac egg cgc etc gtc etc teg ggc 
His Tyr He Ser Leu Leu Leu Lys His Arg Arg Leu Val Leu Ser Gly 



382 



430 



478 



aaa cga gtg ttg gat gca gag ccc ccc gag atg cct cct tgc cgt cga 526 
Lys Arg Val Leu Asp Ala Glu Pro Pro Glu Met Pro Pro Cys Arg Arg 

ggt gtc aat aac ata tea gtc tec etc aaa ggt ctg aag gag aaa tgc 574 
Gly Val Asn Asn He Ser Val Ser Leu Lys Gly Leu Lys Glu Lys Cys 



622 



670 



ccc age ggc acg ggc aag ace tac ctg ace aat cgc ttg gec gag tac 718 
Pro Ser Gly Thr Gly Lys Thr Tyr Leu Thr Asn Arg Leu Ala Glu Tyr 



ctg gtg gag cgc tct ggc cgt gag gtc aca gag ggc ate gtc age ace 
Leu Val Glu Arg Ser Gly Arg Glu Val Thr Glu Gly He Val Ser Thr 

ttc aac atg cac cag cag tct tgc aag gat ctg caa ctg tat ctt tec 
Phe Asn Met His Gin Gin Ser Cys Lys Asp Leu Gin Leu Tyr Leu Ser 

aac eta gee aac cag ata gac egg gaa aca gga att ggg gat gtg ccc 
Asn Leu Ala Asn Gin He Asp Arg Glu Thr Gly He Gly Asp Val Pro 

ctg gtg att eta ttg gat gac ctg agt gaa gca ggc tec ate agt gag 
Leu Val He Leu Leu Asp Asp Leu Ser Glu Ala Gly Ser He Ser Glu 

ttg gtc aat ggg gee etc acc tgc aag tat cat aaa tgt ccc tat att 
Leu Val Asn Gly Ala Leu Thr Cys Lys Tyr His Lys Cys Pro Tyr He 



766 



814 



862 



910 



958 
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ata ggt acc acc aat cag 
lie Gly Thr Thr Asn Gin 



cac ttg age ttc agg atg 
His Leu Ser Phe Arg Met 



aat ggc ttc ctg gtt cgt 
Asn Gly Phe Leu Val Arg 



age gac ate aat gec aac 
Ser Asp lie Asn Ala Asn 



gta ccc aag ctg tgg tat 
Val Pro Lys Leu Trp Tyr 



acc tea gac ttc etc ate 
Thr Ser Asp Phe Leu lie 



ggc att gag gac ttc egg 
Gly lie Glu Asp Phe Arg 



ate att ccc tat eta cag 
He He Pro Tyr Leu Gin 



gga cag aaa get get tgg 
Gly Gin Lys Ala Ala Trp 



ctt ccc tgg cca tea gee 
Leu Pro Trp Pro Ser Ala 



ccc cca ccc acc gtg ggc 
Pro Pro Pro Thr Val Gly 



agg aca gtc aaa gac age 
Arg Thr Val Lys Asp Ser 



atg gee atg ctg ctg aaa 
Met Ala Met Leu Leu Lys 



cca gat cga gaa acc ate 
Pro Asp Arg Glu Thr He 



cct gta aaa atg aca ccc 
Pro Val Lys Met Thr Pro 



ttg acc ttc tec aac aac 
Leu Thr Phe Ser Asn Asn 



tac ctg agg agg aag ctg 
Tyr Leu Arg Arg Lys Leu 



aag gaa gag ctg ctt egg 
Lys Glu Glu Leu Leu Arg 



cat etc cac acc ttc ctt 
His Leu His Thr Phe Leu 



ggc cct tgc ttc ttt ctg 
Gly Pro Cys Phe Phe Leu 



acc tgg ttc att gac ctg 
Thr Trp Phe lie Asp Leu 



gaa gga gec aag gat ggg 
Glu Gly Ala Lys Asp Gly 



gag gac cca gtg gaa tgg 
Glu Asp Pro Val Glu Trp 



caa caa gac caa tea aag 
Gin Gin Asp Gin Ser Lys 



cct cac age att gee tea 
Pro His Ser He Ala Ser 



acc cca agt tct ctg gac 
Thr Pro Ser Ser Leu Asp 



ctt caa gaa get gee aac 
Leu Gin Glu Ala Ala Asn 



ctg gac ccc aac ctt cag 
Leu Asp Pro Asn Leu Gin 



aac cat ggc ttg 1006 
Asn His Gly Leu 



gtg gag cca gee 1054 
Val Glu Pro Ala 



gta gag tea gac 1102 
Val Glu Ser Asp 



gtg etc gac tgg 1150 
Val Leu Asp Trp 



gag aag cac age 1198 
Glu Lys His Ser 



teg tgt ccc att 1246 
Ser Cys Pro He 



tgg aac aac tct 12 94 

Trp Asn Asn Ser 



ata aag gtc cat 1342 
He Lys Val His 



gtc egg gac aca 13 9 0 

Val Arg Asp Thr 



ctg tac cac ctg 1438 
Leu Tyr His Leu 



cct ccc gag gat 1486 
Pro Pro Glu Asp 



tea gat cct ctg 1534 
Ser Asp Pro Leu 



tac att gag tct 1582 
Tyr He Glu Ser 



gca aca ctt 1627 
Ala Thr Leu 
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taagggttcg gcaatcactg tcacccccgg acagcagaac gctggcatca gctatcttag 1687 

ctcctcctct cccctctcct ctttcagagc actggctctc cagccccagg aggagaacag 1747 

gagggaggag gagatgaaag aggagggaca ggttcttggt gctgtacctt tgagaacttc 1807 

ctaggaagga atggtggggt ggcgtttggg aacttgtgcc ccctaaacac atttactggc 1867 

ctcctctaat gactttgggg aaaagatgat tctgggtctt tcccttgact tcttgtttca 1927 

attacaaact cctgggcttt ctggggaggg gttcagaaaa catcaaaaca ctgcagcagt 1987 

tcctaaatga ttctcacaag caaccctgag agagacagtc ttgtgaggga gatctggggg 2 047 

aggcaggaag ctcctcagat tttctcacag acccttccca attccatcac cactgccaac 2107 

aactcctccc ccagagatct ggctggagcc cagaaaaaga agcatgtggt ttaaaaaatg 2167 

tttaaatcaa tctgtaaaag gtaaaaatga aaaacaaaaa caagcaaaca aacaaaaaac 2227 

aatggaaaag atgaagctgg agagagagga accagttgcc aaggtagaga gctgcccgct 22 87 

cctgccctct ggatgacata ggggacatca acaagacggc tgccaacctg agaagtcacc 2347 

aaaccacaaa aataacctta cagccttcag ggaaagacta ccagctctgt ctttctaccc 2407 

tctaatttaa caatgcataa gagtcaataa accctacttt tttaaaaaaa aaaa 2461 



<210> 6 
<211> 542 
<212> PRT 

<213> Homo sapiens 
<400> 6 



Asp Gin Leu Arg Glu Thr Met His Asn Met Gin Leu Glu Val Asp Leu 
15 10 15 

Leu Lys Ala Glu Asn Asp Arg Leu Lys Val Ala Pro Gly Pro Ser Ser 
20 25 30 

Gly Ser Thr Pro Gly Gin Val Pro Gly Ser Ser Ala Leu Ser Ser Pro 
35 40 45 

Arg Arg Ser Leu Gly Leu Ala Leu Thr His Ser Phe Gly Pro Ser Leu 
50 55 60 

Ala Asp Thr Asp Leu Ser Pro Met Asp Gly He Ser Thr Cys Gly Pro 
65 70 75 80 

\ 

Lys Glu Glu Val Thr Leu Arg Val Val Val Arg Met Pro Pro Gin His 
85 90 95 

He He Lys Gly Asp Leu Lys Gin Gin Glu Phe Phe Leu Gly Cys Ser 
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100 105 HO 

Lys Val Ser Gly Lys Val Asp Trp Lys Met Leu Asp Glu Ala Val Phe 
115 120 125 

Gin Val Phe Lys Asp Tyr lie Ser Lys Met Asp Pro Ala Ser Thr Leu 
130 135 140 

Gly Leu Ser Thr Glu Ser He His Gly Tyr Ser He Ser His Val Lys 
145 150 155 160 

Arg Val Leu Asp Ala Glu Pro Pro Glu Met Pro Pro Cys Arg Arg Gly 
165 170 175 

Val Asn Asn He Ser Val Ser Leu Lys Gly Leu Lys Glu Lys Cys Val 
180 185 190 

Asp Ser Leu Val Phe Glu Thr Leu He Pro Lys Pro Met Met Gin His 
195 200 205 

Tyr He Ser Leu Leu Leu Lys His Arg Arg Leu Val Leu Ser Gly Pro 
210 215 220 

Ser Gly Thr Gly Lys Thr Tyr Leu Thr Asn Arg Leu Ala Glu Tyr Leu 
225 230 235 240 

Val Glu Arg Ser Gly Arg Glu Val Thr Glu Gly He Val Ser Thr Phe 
245 250 255 

Asn Met His Gin Gin Ser Cys Lys Asp Leu Gin Leu Tyr Leu Ser Asn 
260 265 270 

Leu Ala Asn Gin He Asp Arg Glu Thr Gly He Gly Asp Val Pro Leu 
275 280 285 

Val He Leu Leu Asp Asp Leu Ser Glu Ala Gly Ser He Ser Glu Leu 
290 295 300 

Val Asn Gly Ala Leu Thr Cys Lys Tyr His Lys Cys Pro Tyr He He 
305 310 315 320 

Gly Thr Thr Asn Gin Pro Val Lys Met Thr Pro Asn His Gly Leu His 
325 330 335 

Leu Ser Phe Arg Met Leu Thr Phe Ser Asn Asn Val Glu Pro Ala Asn 
340 345 350 

Gly Phe Leu Val Arg Tyr Leu Arg Arg Lys Leu Val Glu Ser Asp Ser 
355 360 365 

Asp He Asn Ala Asn Lys Glu Glu Leu Leu Arg Val Leu Asp Trp Val 
370 375 380 

Pro Lys Leu Trp Tyr His Leu His Thr Phe Leu Glu Lys His Ser Thr 
385 390 395 400 



Ser Asp 



Phe Leu He Gly Pro Cys Phe Phe Leu Ser Cys Pro He Gly 
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405 410 415 

lie Glu Asp Phe Arg Thr Trp Phe He Asp Leu Trp Asn Asn Ser He 
420 425 430 

He Pro Tyr Leu Gin Glu Gly Ala Lys Asp Gly He Lys Val His Gly 
435 440 445 



Gin Lys Ala Ala Trp Glu Asp Pro Val Glu Trp Val Arg Asp Thr Leu 
450 455 460 

Pro Trp Pro Ser Ala Gin Gin Asp Gin Ser Lys Leu Tyr His Leu Pro 
465 470 475 480 

Pro Pro Thr Val Gly Pro His Ser He Ala Ser Pro Pro Glu Asp Arg 
485 490 495 

Thr Val Lys Asp Ser Thr Pro Ser Ser Leu Asp Ser Asp Pro Leu Met 
500 505 510 

Ala Met Leu Leu Lys Leu Gin Glu Ala Ala Asn Tyr He Glu Ser Pro 
515 520 525 

Asp Arg Glu Thr He Leu Asp Pro Asn Leu Gin Ala Thr Leu 
530 535 540 



<210> 7 
<211> 2568 
<212> DNA 
<213> mouse 

<220> 

<221> CDS 

<223> (1) . - (1695) 



<400> 7 

gaa eta tgg gaa aaa gag atg aag etc acg gat ate egg ttg gag gee 
Glu Leu Trp Glu Lys Glu Met Lys Leu Thr Asp He Arg Leu Glu Ala 

etc aac tct gec cac cag ctg gac cag ctt egg gag acc atg cac aat 
Leu Asn Ser Ala His Gin Leu Asp Gin Leu Arg Glu Thr Met His Asn 

atg cag ttg gag gtg gac ctg ctg aaa gca gag aat gac egg ctg aag 
Met Gin Leu Glu Val Asp Leu Leu Lys Ala Glu Asn Asp Arg Leu Lys 

gtt gee ccc ggc ccc tec tea ggc tgc act cca ggg cag gtc cct ggg 
Val Ala Pro Gly Pro Ser Ser Gly Cys Thr Pro Gly Gin Val Pro Gly 



48 



96 



144 



192 
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tea teg get ctg teg tec cct cga cgt tec ctg ggc ctt gca etc age 240 
Ser Ser Ala Leu Ser Ser Pro Arg Arg Ser Leu Gly Leu Ala Leu Ser 

cat cct ttc agt cct agt etc aca gac aca gac etc tea ccc atg gat 288 
His Pro Phe Ser Pro Ser Leu Thr Asp Thr Asp Leu Ser Pro Met Asp 

ggc ate age acc tgt ggt tea aag gaa gag gtg acc ctg egg gtg gtg 336 
Gly lie Ser Thr Cys Gly Ser Lys Glu Glu Val Thr Leu Arg Val Val 

gtc egg atg ccg ccc cag cac ate ate aaa ggg gac tta aag cag cag 3 84 

Val Arg Met Pro Pro Gin His lie lie Lys Gly Asp Leu Lys Gin Gin 

gag ttc ttc ctg ggt tgc age aag gtc agt ggc aaa gtt gac tgg aag 43 2 

Glu Phe Phe Leu Gly Cys Ser Lys Val Ser Gly Lys Val Asp Trp Lys 

atg ctg gat gaa gec gtt ttc caa gtg ttc aag gac tac att tct aaa 480 
Met Leu Asp Glu Ala Val Phe Gin Val Phe Lys Asp Tyr lie Ser Lys 

atg gac cca gee tea acc ctg gga ctg age act gag tec ata cat ggc 528 
Met Asp Pro Ala Ser Thr Leu Gly Leu Ser Thr Glu Ser lie His Gly 



tat age etc age cac gtg aaa cga gtg ctg gat get gag ccc cca gag 576 
Tyr Ser Leu Ser His Val Lys Arg Val Leu Asp Ala Glu Pro Pro Glu 

atg cct cct tgc cgc cga ggt gtc aat aac ata tea gtc get etc aaa 624 
Met Pro Pro Cys Arg Arg Gly Val Asn Asn lie Ser Val Ala Leu Lys 

ggt ctg aaa gag aag tgt gtc gac age ctg gtg ttc gag acg ctt ate 672 
Gly Leu Lys Glu Lys Cys Val Asp Ser Leu Val Phe Glu Thr Leu lie 

ccc aag ccc atg atg cag cac tac ate age etc ctg etc aag cac egg 720 
Pro Lys Pro Met Met Gin His Tyr lie Ser Leu Leu Leu Lys His Arg 

cgc ctg gtg etc tec ggc ccc agt ggc acc ggc aag acc tac ttg acc 768 
Arg Leu Val Leu Ser Gly Pro Ser Gly Thr Gly Lys Thr Tyr Leu Thr 

aat egg eta gee gag tac ctg gtg gag cgc tec ggc cgc gag gtc acg 816 
Asn Arg Leu Ala Glu Tyr Leu Val Glu Arg Ser Gly Arg Glu Val Thr 

gat ggc ate gtc age act ttc aac atg cac cag cag tct tgc aag gat 864 
Asp Gly He Val Ser Thr Phe Asn Met His Gin Gin Ser Cys Lys Asp 
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ctg caa ctg tac etc tec aac eta gee aac cag ata gac egg gaa aca 
Leu Gin Leu Tyr Leu Ser Asn Leu Ala Asn Gin lie Asp Arg Glu Thr 



cac aaa tgt ccc tac att ata ggt ace acc aat cag cct gta aaa atg 
His Lys Cys Pro Tyr lie lie Gly Thr Thr Asn Gin Pro Val Lys Met 



aag ttg gta gag tea gac agt gac gtc aat get aac aag gaa gag ctg 
Lys Leu Val Glu Ser Asp Ser Asp Val Asn Ala Asn Lys Glu Glu Leu 



ttc ctg tec tgt ccc att ggc ate gag gac ttc egg acc tgg ttc att 1344 
Phe Leu Ser Cys Pro lie Gly lie Glu Asp Phe Arg Thr Trp Phe lie 



gat ggg ate aag gtt cat gga cag aaa get get tgg gaa gac ccg gtg 
Asp Gly lie Lys Val His Gly Gin Lys Ala Ala Trp Glu Asp Pro Val 



gaa tgg gtc cga gac act ctt ccc tgg ccg teg gec caa caa gac caa 
Glu Trp Val Arg Asp Thr Leu Pro Trp Pro Ser Ala Gin Gin Asp Gin 



912 



ggg ata ggg gat gtg ccc ttg gtg ate etc ctg gat gat ctg agt gaa 960 
Gly lie Gly Asp Val Pro Leu Val lie Leu Leu Asp Asp Leu Ser Glu 

gca ggc tec ate agt gag ctg gtc aat ggg gec etc acc tgc aag tat 1008 
Ala Gly Ser lie Ser Glu Leu Val Asn Gly Ala Leu Thr Cys Lys Tyr 



1056 



aca ccc aac cat ggc ttg cac ttg age ttc agg atg ctg acc ttc teg 1104 

Thr Pro Asn His Gly Leu His Leu Ser Phe Arg Met Leu Thr Phe Ser 

aac aat gtg gaa cca gec aat ggc ttt ctg gtc cgt tac ctg egg agg 1152 

Asn Asn Val Glu Pro Ala Asn Gly Phe Leu Val Arg Tyr Leu Arg Arg 



1200 



ctt egg gtg ctg gac tgg gtg ccc aag ctg tgg tat cac etc cac acc 124 8 

Leu Arg Val Leu Asp Trp Val Pro Lys Leu Trp Tyr His Leu His Thr 

ttc ctg gag aag cac age acc teg gac ttc etc att ggc cct tgc ttc 12 96 

Phe Leu Glu Lys His Ser Thr Ser Asp Phe Leu lie Gly Pro Cys Phe 



gac ctg tgg aac aat tec ate ate ccc tat eta cag gaa gga gec aag 1392 
Asp Leu Trp Asn Asn Ser lie lie Pro Tyr Leu Gin Glu Gly Ala Lys 



1440 



1488 



tea aag etc tac cac ctg ccc ccg cct tct gtg ggc ccc cac age act 1536 
Ser Lys Leu Tyr His Leu Pro Pro Pro Ser Val Gly Pro His Ser Thr 
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gcc tea ccc ccg gag gac agg aca gtc aaa gac age act cca aac tec 
Ala Ser Pro Pro Glu Asp Arg Thr Val Lys Asp Ser Thr Pro Asn Ser 



1584 



etc gac tea gat ccc ctg atg gcc atg eta ctg aaa etc caa gaa get 
Leu Asp Ser Asp Pro Leu Met Ala Met Leu Leu Lys Leu Gin Glu Ala 



1632 



gcc aac tac att gag tea cca gat cga gag act ate ctg gac ccc aac 
Ala Asn Tyr lie Glu Ser Pro Asp Arg Glu Thr He Leu Asp Pro Asn 



1680 



etc cag gcg aca etc tgagggcccg gcagtcactg tcaccctgga gggcagaagg 
Leu Gin Ala Thr Leu 



1735 



ctggcttcag 


catcattagc 


tctcctctgc 


cctcttcctt 


catagctctg 


gctcaccagc 


1795 


ctcgccaaga 


gaacaggagg 


yadyday dyy 


yL-dyyctyycty 




t" cci<nY~ nrh era 


1855 


acctttgaga 


acttcctact 


aggaattgga 


gggggtggag 


tttgagaact 


ccgtgcccct 


1915 


taactacatt 


tgctggcctc 


ctcttacgac 


ttaggagaaa 


agatgattct 


ggtcttttct 


1975 


tcaagttttg 


tttcacctac 


aaactcttgg 


gctttctggg 


gagggattcg 


gaagatataa 


2035 


acagacaaac 


aaaaacaaac 


aaaccaacta 


cagcagttcc 


aagctcgttc 


tcacaaacac 


2095 


ctctgagaca 


gtcacatgtg 


ggcaaatcta 


agggaggcag 


gaagctctac 


agactttctt 


2155 


gcaaaccctt 


cccagttctg 


tcgacactgc 


caacaacctc 


cccgccagag 


acctggccag 


2215 


agecaagaaa 


agagaagcat 


gtggtttaac 


agaaaaacaa 


aacaaaacaa 


aacaaaaaat 


2275 


atatgtgtaa 


atcaacctgt 


agaaggtaaa 


aacggcaatg 


gaaaagatga 


agctggaagg 


2335 


aggggeccag 


ttgecaagat 


ggaacgagag 


ctgccagatc 


ttgccttctg 


gatgacaaga 


2395 


ggggacattg 


caagatggct 


gecagtctaa 


aacgtcacca 


gaccacaaga 


gtaacatcac 


2455 


agecttcgaa 


gaaaggecac 


aagctgtctt 


tctgccctct 


aactgaacat 


gcatgaaaag 


2515 


tcaataaacc 


ctacttttta 


atttttaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaa 


2568 



<210> 8 

<211> 565 

<212> PRT 

<213> mouse 

<400> 8 

Glu Leu Trp Glu Lys Glu Met Lys Leu Thr Asp He Arg Leu Glu Ala 
15 10 15 

Leu Asn Ser Ala His Gin Leu Asp Gin Leu Arg Glu Thr Met His Asn 
20 25 30 
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Met Gin Leu Glu Val Asp Leu Leu Lys Ala Glu Asn Asp Arg Leu Lys 
35 40 45 

Val Ala Pro Gly Pro Ser Ser Gly Cys Thr Pro Gly Gin Val Pro Gly 
50 55 60 

Ser Ser Ala Leu Ser Ser Pro Arg Arg Ser Leu Gly Leu Ala Leu Ser 
65 70 75 80 

His Pro Phe Ser Pro Ser Leu Thr Asp Thr Asp Leu Ser Pro Met Asp 
85 90 95 

Gly lie Ser Thr Cys Gly Ser Lys Glu Glu Val Thr Leu Arg Val Val 
100 105 110 

Val Arg Met Pro Pro Gin His lie lie Lys Gly Asp Leu Lys Gin Gin 
115 120 125 

Glu Phe Phe Leu Gly Cys Ser Lys Val Ser Gly Lys Val Asp Trp Lys 
130 135 140 

Met Leu Asp Glu Ala Val Phe Gin Val Phe Lys Asp Tyr lie Ser Lys 
145 150 155 160 

Met Asp Pro Ala Ser Thr Leu Gly Leu Ser Thr Glu Ser lie His Gly 
165 170 175 

Tyr Ser Leu Ser His Val Lys Arg Val Leu Asp Ala Glu Pro Pro Glu 
180 185 190 

Met Pro Pro Cys Arg Arg Gly Val Asn Asn lie Ser Val Ala Leu Lys 
195 200 205 

Gly Leu Lys Glu Lys Cys Val Asp Ser Leu Val Phe Glu Thr Leu lie 
210 215 220 

Pro Lys Pro Met Met Gin His Tyr lie Ser Leu Leu Leu Lys His Arg 
225 230 235 240 

Arg Leu Val Leu Ser Gly Pro Ser Gly Thr Gly Lys Thr Tyr Leu Thr 
245 250 255 

Asn Arg Leu Ala Glu Tyr Leu Val Glu Arg Ser Gly Arg Glu Val Thr 
260 265 270 

Asp Gly lie Val Ser Thr Phe Asn Met His Gin Gin Ser Cys Lys Asp 
275 280 285 



Leu Gin Leu Tyr Leu Ser Asn Leu Ala Asn Gin lie Asp Arg Glu Thr 
290 295 300 

Gly lie Gly Asp Val Pro Leu Val lie Leu Leu Asp Asp Leu Ser Glu 

305 310 315 320 

Ala Gly Ser lie Ser Glu Leu Val Asn Gly Ala Leu Thr Cys Lys Tyr 
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325 330 335 

His Lys Cys Pro Tyr lie lie Gly Thr Thr Asn Gin Pro Val Lys Met 
340 345 350 

Thr Pro Asn His Gly Leu His Leu Ser Phe Arg Met Leu Thr Phe Ser 
355 360 365 

Asn Asn Val Glu Pro Ala Asn Gly Phe Leu Val Arg Tyr Leu Arg Arg 
370 375 380 

Lys Leu Val Glu Ser Asp Ser Asp Val Asn Ala Asn Lys Glu Glu Leu 
385 390 395 400 

Leu Arg Val Leu Asp Trp Val Pro Lys Leu Trp Tyr His Leu His Thr 
405 410 415 

Phe Leu Glu Lys His Ser Thr Ser Asp Phe Leu lie Gly Pro Cys Phe 
420 425 430 

Phe Leu Ser Cys Pro lie Gly lie Glu Asp Phe Arg Thr Trp Phe lie 
435 440 445 

Asp Leu Trp Asn Asn Ser lie lie Pro Tyr Leu Gin Glu Gly Ala Lys 
450 455 460 

Asp Gly lie Lys Val His Gly Gin Lys Ala Ala Trp Glu Asp Pro Val 
465 470 475 480 

Glu Trp Val Arg Asp Thr Leu Pro Trp Pro Ser Ala Gin Gin Asp Gin 
485 490 495 

Ser Lys Leu Tyr His Leu Pro Pro Pro Ser Val Gly Pro His Ser Thr 
500 505 510 

Ala Ser Pro Pro Glu Asp Arg Thr Val Lys Asp Ser Thr Pro Asn Ser 
515 520 525 

Leu Asp Ser Asp Pro Leu Met Ala Met Leu Leu Lys Leu Gin Glu Ala 
530 535 540 

Ala Asn Tyr lie Glu Ser Pro Asp Arg Glu Thr lie Leu Asp Pro Asn 
545 550 555 560 

Leu Gin Ala Thr Leu 
565 



<210> 9 

<211> 1025 

<212> DNA 

<213> mouse 

<220> 

<221> CDS 

<223> (1) . . (129) 
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<400> 9 



cca ata gaa etc egg ate aag agg cag aat tec tea gat age ate tec 48 
Pro lie Glu Leu Arg lie Lys Arg Gin Asn Ser Ser Asp Ser lie Ser 



age etc aac age ate ace age cat tec age ate ggc age age aaa gat 96 
Ser Leu Asn Ser lie Thr Ser His Ser Ser lie Gly Ser Ser Lys Asp 

get gat gec aag aag aaa aag aag aag agt tgg gtatgtaaag gcttggggat 14 9 

Ala Asp Ala Lys Lys Lys Lys Lys Lys Ser Trp 



cggcctgtgc 


taggagtcac 


tcaccctgtt 


gcagggaact 


gacccctttc 


aggatcaaca 


209 


aagagggtcc 


cttctaacag 


gatgecagtg 


ttgtgacatc 


tgctggggac 


aaaaattcac 


269 


taagttccca 


ttcctctatc 


cattgtctat 


tctccttacc 


accgccctgc 


acatataccc 


329 


cagcccccca 


ccgtccctgc 


atcctttata 


catgtctget 


atcctggggc 


tctacctact 


389 


gatgaggtca 


aatgtatttg 


geegtagaag 


gagctgagaa 


aattattcat 


gggtgggaga 


449 


gtggggcatg 


tggagagaat 


ttgtaagcca 


agcagggtac 


tetagacget 


cctggggctg 


509 


ttgctttagt 


ttgggtgagg 


aggctgtgga 


acgtccccat. 


cgctccaaag 


ectgettttg 


569 


tctggtccag 


aggtgggttt 


gttctgtgtg 


gtatcccccc 


tgtaactcta 


aactggcttt 


629 


gggtgagctt 


tctacaatct 


gtacgcaggt 


gtagggcact 


gectgactga 


ctgaaaggga 


689 


gagtgaccca 


gagtgagcgt 


cttgtccctg 


tccctgctga 


ggagggctgg 


ctacagactt 


749 


tggcctagtg 


cagacaggag 


ccagctgtgt 


ggagaagcag 


ctgtgtgaaa 


tgcatgagta 


809 


gtgtcgctgc 


tgetgetget 


getgetttet 


tttcattgtt 


tttttttttt 


tttctttcct 


869 


tttatttcct 


teaaaatget 


gacctcaaat 


ccctattttt 


tttccaggtt 


tatgaggtaa 


929 


gaaactcggt 


tcctctcctc 


gtgettttte 


tttccctttg 


cacaccttcg 


tgtttccaga 


989 


gcaagcacct 


ctcttaaaaa 


aaaaaaaaaa 


aaaaaa 






1025 



<210> 10 
<211> 43 
<212> PRT 
<213> mouse 

<400> 10 

Pro lie Glu Leu Arg He Lys Arg Gin Asn Ser Ser Asp Ser He Ser 
1 5 10 15 

Ser Leu Asn Ser He Thr Ser His Ser Ser He Gly Ser Ser Lys Asp 
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20 25 30 

Ala Asp Ala Lys Lys Lys Lys Lys Lys Ser Trp 
35 40 



<210> 11 

<211> 8690 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<223> (503) . . (6187) 

<400> 11 



ctctcagcgg cccaagcagt ttctttctgg gtgacaagaa tgtgcctcgg ttggtttttc 60 

ttttttttct ccatctcctt aagacgattt ccatagtaac ctgatcaagt ggctcaaaat 120 

cgcaaacctg aggatttccg cggcccgccg gcaagacctc ggccaggttt acaggaaatc 180 

tgtcattttt tattaaaatg gaaaactgtg aagaaagaaa aagatagcag ttgaagtcaa 240 

aattctcgga tgactatttt gcttttgagg agtcagcatt taaaaacgat atgctgattt 300 

ggaaggtcct gggagtaaac tgcaaacttt attttttcca ttcaatcaat ggatttttta 360 

atcattcctt ggagtcgatg aagttcggaa acggtgtgtg atggggaacg tggcgggcca 420 

gtgtgttcct agaaattgca tcttggatta gtttgctgct tttttgaaga gattccattt 480 

tgaagggcaa gaacctaatg tg atg gat tta tct tea gaa atg aac aga cat 532 

Met Asp Leu Ser Ser Glu Met Asn Arg His 

ggg aag aat cca gtg agt cac aag eta gaa gat cag aag aag att tac 580 
Gly Lys Asn Pro Val Ser His Lys Leu Glu Asp Gin Lys Lys lie Tyr 

act gac tgg gec aac cac tac eta gca aaa tea ggc cac aag egg ctg 62 8 
Thr Asp Trp Ala Asn His Tyr Leu Ala Lys Ser Gly His Lys Arg Leu 

ate aag gac ttg caa caa gac att gca gat gga gta etc eta gca gaa 676 
lie Lys Asp Leu Gin Gin Asp lie Ala Asp Gly Val Leu Leu Ala Glu 

ate ate cag att att gca aat gaa aaa gtt gaa gat ate aat gga tgt 724 
lie lie Gin lie lie Ala Asn Glu Lys Val Glu Asp lie Asn Gly Cys 

cct aga agt cag tct cag atg att gaa aat gtt gat gtc tgc ctt agt 772 
Pro Arg Ser Gin Ser Gin Met He Glu Asn Val Asp Val Cys Leu Ser 
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ttt eta gca gec aga ggg gta aat gtt caa ggt eta tct get gaa gaa 820 
Phe Leu Ala Ala Arg Gly Val Asn Val Gin Gly Leu Ser Ala Glu Glu 

ata aga aat gga aac tta aaa gee att eta ggg ctg ttt ttc agt tta 868 
lie Arg Asn Gly Asn Leu Lys Ala lie Leu Gly Leu Phe Phe Ser Leu 

tct cgc tac aag cag caa caa cac cat caa caa cag tac tat cag tec 916 
Ser Arg Tyr Lys Gin Gin Gin His His Gin Gin Gin Tyr Tyr Gin Ser 

ttg gtg gaa ctt cag cag cga gtt act cac get tec cct cca teg gaa 964 
Leu Val Glu Leu Gin Gin Arg Val Thr His Ala Ser Pro Pro Ser Glu 

gee age cag gee aaa acc cag caa gat atg cag tec agg ctt cca ggg 1012 
Ala Ser Gin Ala Lys Thr Gin Gin Asp Met Gin Ser Arg Leu Pro Gly 



ccc tct agg gtg cct get gca gga age age age aag gtc cag gga gee 1060 
Pro Ser Arg Val Pro Ala Ala Gly Ser Ser Ser Lys Val Gin Gly Ala 

tct aat tta aat agg aga agt cag age ttt aac age att gac aaa aac 1108 
Ser Asn Leu Asn Arg Arg Ser Gin Ser Phe Asn Ser lie Asp Lys Asn 

aag cct cca aat tat gca aat gga aac gaa aaa ggt gaa gac cct gaa 1156 
Lys Pro Pro Asn Tyr Ala Asn Gly Asn Glu Lys Gly Glu Asp Pro Glu 

aca aga aga atg aga aca gtt aaa aac ata gca gac ttg agg cag aat 1204 
Thr Arg Arg Met Arg Thr Val Lys Asn lie Ala Asp Leu Arg Gin Asn 

tta gaa gag act atg tec agt ctt cgt ggg act cag ata age cac age 1252 
Leu Glu Glu Thr Met Ser Ser Leu Arg Gly Thr Gin lie Ser His Ser 

acc ctg gag aca aca ttt gac age act gtg aca aca gaa gtt aat gga 1300 
Thr Leu Glu Thr Thr Phe Asp Ser Thr Val Thr Thr Glu Val Asn Gly 

agg acc ata ccc aac ttg aca agt cga ccc acc ccc atg acc tgg agg 1348 
Arg Thr lie Pro Asn Leu Thr Ser Arg Pro Thr Pro Met Thr Trp Arg 

ttg ggc cag gca tgt ccg cga ctt cag gcg gga gat get ccc tec ctg 13 96 

Leu Gly Gin Ala Cys Pro Arg Leu Gin Ala Gly Asp Ala Pro Ser Leu 

ggt get ggc tat cct cgc agt ggt acc agt cga ttc ate cac aca gac 1444 
Gly Ala Gly Tyr Pro Arg Ser Gly Thr Ser Arg Phe lie His Thr Asp 
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ccc teg agg ttc atg tat acc acg cct etc cgt cga get get gtc tct 
Pro Ser Arg Phe Met Tyr Thr Thr Pro Leu Arg Arg Ala Ala Val Ser 



tac atg aca gat gga gga ctt aac eta tat act aga agt ctg aac cga 
Tyr Met Thr Asp Gly Gly Leu Asn Leu Tyr Thr Arg Ser Leu Asn Arg 



gac age cat ggg gat get ggt ggc aag tgg aag act gtg tec tct gga 

Asp Ser His Gly Asp Ala Gly Gly Lys Trp Lys Thr Val Ser Ser Gly 

ctt cct gaa gac ccc gag aag gca ggg cag aaa get tec ctg tct gtt 

Leu Pro Glu Asp Pro Glu Lys Ala Gly Gin Lys Ala Ser Leu Ser Val 



1492 



agg ctg gga aac atg tea cag att gac atg agt gag aaa gca age agt 154 0 

Arg Leu Gly Asn Met Ser Gin lie Asp Met Ser Glu Lys Ala Ser Ser 

gac ctg gac atg tct tct gag gtc gat gtg ggt gga tat atg agt gat 1588 

Asp Leu Asp Met Ser Ser Glu Val Asp Val Gly Gly Tyr Met Ser Asp 

ggt gat ate ctt ggg aaa agt etc agg act gat gac ate aac agt ggg 1636 

Gly Asp lie Leu Gly Lys Ser Leu Arg Thr Asp Asp lie Asn Ser Gly 



1684 



ata cca gac aca gca act tec egg gac ate ate cag aga ggg gtt cac 1732 
lie Pro Asp Thr Ala Thr Ser Arg Asp lie lie Gin Arg Gly Val His 

gat gtg aca gtg gat gca gac age tgg gat gac age agt tea gtg age 1780 
Asp Val Thr Val Asp Ala Asp Ser Trp Asp Asp Ser Ser Ser Val Ser 

agt ggt etc agt gac acc ctt gat aac ate age act gat gac ctg aac 
Ser Gly Leu Ser Asp Thr Leu Asp Asn lie Ser Thr Asp Asp Leu Asn 

acc aca tec tct gtc age tct tac tec aac ate acc gtc ccc tct agg 
Thr Thr Ser Ser Val Ser Ser Tyr Ser Asn lie Thr Val Pro Ser Arg 

aag aat act cag ctg agg aca gat tea gag aaa cgc tec acc aca gac 
Lys Asn Thr Gin Leu Arg Thr Asp Ser Glu Lys Arg Ser Thr Thr Asp 

gag acc tgg gat agt cct gag gaa ctg aaa aaa cca gaa gaa gat ttt 1972 
Glu Thr Trp Asp Ser Pro Glu Glu Leu Lys Lys Pro Glu Glu Asp Phe 



1828 



1876 



1924 



2020 



2068 



tea cag aca ggt tec tgg aga aga ggc atg tct gee caa gga ggg gcg 2116 
Ser Gin Thr Gly Ser Trp Arg Arg Gly Met Ser Ala Gin Gly Gly Ala 
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cca tct agg cag aaa get gga aca agt 
Pro Ser Arg Gin Lys Ala Gly Thr Ser 



acc gat gat gec aaa get tct gag aaa 
Thr Asp Asp Ala Lys Ala Ser Glu Lys 



tea tct eta caa aga tct cct tea gat 
Ser Ser Leu Gin Arg Ser Pro Ser Asp 



gaa ggg aaa aag ccc ccc tea ggc att 
Glu Gly Lys Lys Pro Pro Ser Gly lie 



tec ttt ggc ttt aag aaa cca agt gga 
Ser Phe Gly Phe Lys Lys Pro Ser Gly 



acc age agt gga gca acc ata aca agt 
Thr Ser Ser Gly Ala Thr lie Thr Ser 



att cca aaa tct get gee att ggc ggg 
lie Pro Lys Ser Ala Ala lie Gly Gly 



acc agt ttg gac ggt tea cag aat cag 
Thr Ser Leu Asp Gly Ser Gin Asn Gin 



age tea aag act acc eta caa tat cgc 
Ser Ser Lys Thr Thr Leu Gin Tyr Arg 



tec age acc agt ggc att cct ggc cga 
Ser Ser Thr Ser Gly lie Pro Gly Arg 



age agt att gat tec aac gtc age age 
Ser Ser lie Asp Ser Asn Val Ser Ser 



teg aaa ctg aga gaa cca act aaa att 
Ser Lys Leu Arg Glu Pro Thr Lys lie 



gtc acc gtc aac caa aca gac aag gaa 
Val Thr Val Asn Gin Thr Asp Lys Glu 



gat tea gaa agt gtt tct ttg tea ggt 
Asp Ser Glu Ser Val Ser Leu Ser Gly 



gca etc aaa aca ccc ggg aaa 2164 
Ala Leu Lys Thr Pro Gly Lys 



gga aaa get ccc eta aaa gga 2212 
Gly Lys Ala Pro Leu Lys Gly 



gca gga aaa age agt gga gat 2260 
Ala Gly Lys Ser Ser Gly Asp 



gga aga teg act gee acc age 2308 
Gly Arg Ser Thr Ala Thr Ser 



gta ggg tea tct gee atg ate 2356 
Val Gly Ser Ser Ala Met lie 



ggc tct gca aca ctg ggt aaa 2404 
Gly Ser Ala Thr Leu Gly Lys 



aag tea aat gca ggg aga aaa 2452 
Lys Ser Asn Ala Gly Arg Lys 



gat gat gtt gtg ctg cat gtt 2500 
Asp Asp Val Val Leu His Val 

age ttg ccc cgc cct tea aaa 2548 
Ser Leu Pro Arg Pro Ser Lys 



gga ggc cac aga tec agt acc 2596 
Gly Gly His Arg Ser Ser Thr 



aag tct get ggg gec acc acc 2644 
Lys Ser Ala Gly Ala Thr Thr 



ggg tea ggg cgc teg agt cct 2692 
Gly Ser Gly Arg Ser Ser Pro 



aag gaa aaa gta gca gtc tea 2740 
Lys Glu Lys Val Ala Val Ser 

tec ccc aaa tec age ccc acc 2788 
Ser Pro Lys Ser Ser Pro Thr 
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tct gcc age gec tgt ggt gca caa ggt 
Ser Ala Ser Ala Cys Gly Ala Gin Gly 



tat cca gat att gcc tea ccc aca ttt 
Tyr Pro Asp lie Ala Ser Pro Thr Phe 



gca ggt ggc aaa tct gcc tct gca cct 
Ala Gly Gly Lys Ser Ala Ser Ala Pro 



tec tea gta atg ccc age cct agt ace 
Ser Ser Val Met Pro Ser Pro Ser Thr 



ctg gag tea ccg teg tec ggt acg ggc 
Leu Glu Ser Pro Ser Ser Gly Thr Gly 



eta age ggc age age age cct etc ttc 
Leu Ser Gly Ser Ser Ser Pro Leu Phe 



aca gat gtt ata age tta agt cac teg 
Thr Asp Val lie Ser Leu Ser His Ser 



gtt cac tct ttc aca tea ggt ggt etc 
Val His Ser Phe Thr Ser Gly Gly Leu 



agt tec tct gca ggc age aag gat act 
Ser Ser Ser Ala Gly Ser Lys Asp Thr 



age etc cac acg age tct gag tec att 
Ser Leu His Thr Ser Ser Glu Ser lie 



ggc tec ttg tct gga ctg ace aca ggc 
Gly Ser Leu Ser Gly Leu Thr Thr Gly 



etc atg aga acg ggt agt gtg aga tct 
Leu Met Arg Thr Gly Ser Val Arg Ser 



ctt gac aga aat aca eta ccc aaa aag 
Leu Asp Arg Asn Thr Leu Pro Lys Lys 



tct egg cag gcc aac caa gaa gag ggc 
Ser Arg Gin Ala Asn Gin Glu Glu Gly 



tct act gga ggg ctt cag gac act ggc 



etc agg cag cca gga tec aag 2836 
Leu Arg Gin Pro Gly Ser Lys 



cga agg ttg ttt ggt gcc aag 2884 
Arg Arg Leu Phe Gly Ala Lys 



aat act gag ggt gtg aaa tct 2932 
Asn Thr Glu Gly Val Lys Ser 



aca tta gcg egg caa ggc agt 2 98 0 

Thr Leu Ala Arg Gin Gly Ser 



age atg ggc agt get ggt ggg 3 02 8 

Ser Met Gly Ser Ala Gly Gly 



aat aaa ccc tea gac tta act 3076 
Asn Lys Pro Ser Asp Leu Thr 



ttg gcc tec age cca gca teg 3124 
Leu Ala Ser Ser Pro Ala Ser 



gtg tgg get gcc aat atg age 3172 
Val Trp Ala Ala Asn Met Ser 



ccg age tac cag tec atg act 3220 
Pro Ser Tyr Gin Ser Met Thr 



gac etc ccc etc age cat cat 3268 
Asp Leu Pro Leu Ser His His 



act cac gag gtc cag age ctg 3316 
Thr His Glu Val Gin Ser Leu 



act etc tea gaa age atg cag 3364 
Thr Leu Ser Glu Ser Met Gin 



gga eta aga tat ace cca tea 3412 
Gly Leu Arg Tyr Thr Pro Ser 



aaa gag tgg ttg cgt tct cat 3460 
Lys Glu Trp Leu Arg Ser His 



aac cag tea cct ctg gtt tec 3508 
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Ser Thr Gly Gly Leu Gin Asp Thr Gly 



cct tct gcc atg tea tct tct gca get 
Pro Ser Ala Met Ser Ser Ser Ala Ala 



ttg gtg age cca aca aat ttg tct caa 
Leu Val Ser Pro Thr Asn Leu Ser Gin 



atg atg cgc tea aac age ate cca gcc 
Met Met Arg Ser Asn Ser lie Pro Ala 



tat gat gac tec cag ctt tgt ggg agt 
Tyr Asp Asp Ser Gin Leu Cys Gly Ser 



cct cgt gcc ate agt cat teg ggc tea 
Pro Arg Ala lie Ser His Ser Gly Ser 



gtt cat ggc tct tea tta tea ctg gtg 
Val His Gly Ser Ser Leu Ser Leu Val 



tct aca get gaa gaa aag get cat tea 
Ser Thr Ala Glu Glu Lys Ala His Ser 



aga gag ctg gtt gca tea caa gaa aaa 
Arg Glu Leu Val Ala Ser Gin Glu Lys 



ctt tea gca aat get cac ctt gta gca 
Leu Ser Ala Asn Ala His Leu Val Ala 



aat atg act ggc cga ttg caa agt eta 
Asn Met Thr Gly Arg Leu Gin Ser Leu 



gaa tct gaa ctt ata gaa eta aga gaa 
Glu Ser Glu Leu lie Glu Leu Arg Glu 



cag aat tct get gcc cag gcg get att 
Gin Asn Ser Ala Ala Gin Ala Ala lie 



gac cat cct ccc aaa gat ctt cgc ate 
Asp His Pro Pro Lys Asp Leu Arg lie 



agt gtt tct agt ate aac agt gcc aca 
Ser Val Ser Ser lie Asn Ser Ala Thr 



Asn Gin Ser Pro Leu Val Ser 

gga aaa tac cac ttt tct aac 3556 
Gly Lys Tyr His Phe Ser Asn 

ttt aac ctt ccc ggg ccc age 3604 
Phe Asn Leu Pro Gly Pro Ser 

caa gac tct tec ttc gat etc 3652 
Gin Asp Ser Ser Phe Asp Leu 

gcc act tct ctg gag gaa aga 3700 
Ala Thr Ser Leu Glu Glu Arg 

ttc aga gac age atg gaa gaa 374 8 

Phe Arg Asp Ser Met Glu Glu 

tec age act tct tct ctt tac 3796 
Ser Ser Thr Ser Ser Leu Tyr 

gag caa ate cat aaa ctg egg 3844 
Glu Gin lie His Lys Leu Arg 

gtt get acc etc aca tct cag 3892 
Val Ala Thr Leu Thr Ser Gin 

get ttt gaa aag age tta ggg 3 94 0 

Ala Phe Glu Lys Ser Leu Gly 

act atg aca gcg gaa caa aag 3988 
Thr Met Thr Ala Glu Gin Lys 

acc att gaa atg ctg aag get 4036 
Thr lie Glu Met Leu Lys Ala 

cag gga gca ctg aat ggt cca 4084 
Gin Gly Ala Leu Asn Gly Pro 

aga aga cag cat tec tct gaa 4132 
Arg Arg Gin His Ser Ser Glu 

age cat tec agt att ggc agt 4180 
Ser His Ser Ser lie Gly Ser 
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ggt aat gat gcc gac tec aag aag aag aaa aag aaa aac tgg ctg aga 422 8 

Gly Asn Asp Ala Asp Ser Lys Lys Lys Lys Lys Lys Asn Trp Leu Arg 



agt tct ttc aaa caa gcc ttt ggg aag aaa aag tec acc aag cct cct 
Ser Ser phe Lys Gin Ala Phe Gly Lys Lys Lys Ser Thr Lys Pro Pro 



tea tea cat tct gac att gaa gag ctt act gat tea tec ctt ccg gca 
Ser Ser His Ser Asp lie Glu Glu Leu Thr Asp Ser Ser Leu Pro Ala 



aag ccc tea caa tct get tea gcg ate tgt gaa tgc aca gaa get gag 
Lys Pro Ser Gin Ser Ala Ser Ala lie Cys Glu Cys Thr Glu Ala Glu 



aaa tta acg gat att egg ctg gag gcc etc age tct get cat cat ctt 
Lys Leu Thr Asp lie Arg Leu Glu Ala Leu Ser Ser Ala His His Leu 



aag cct act egg cca ccg tea gaa tec tea age age acc tec tct tea 
Lys Pro Thr Arg Pro Pro Ser Glu Ser Ser Ser Ser Thr Ser Ser Ser 



tct tec agg cag tea tta gga ctt tct eta aac aat ttg aac ate aca 
Ser Ser Arg Gin Ser Leu Gly Leu Ser Leu Asn Asn Leu Asn lie Thr 



gga cat aaa gat ggc cgc agt gtg aaa att ata gtc tec ata age aag 
Gly His Lys Asp Gly Arg Ser Val Lys lie lie Val Ser lie Ser Lys 



ggc tat ggt cga gca aag gac caa aaa tct cag gca tat ttg ata gga 
Gly Tyr Gly Arg Ala Lys Asp Gin Lys Ser Gin Ala Tyr Leu lie Gly 



4276 



4324 



tec ccc aag tta ccc cat aat get ggt gac tgt ggc tea gca tec atg 4372 
Ser Pro Lys Leu Pro His Asn Ala Gly Asp Cys Gly Ser Ala Ser Met 



4420 



gca gag ata att ctg cag ctg aag age gag etc aga gaa aag gaa tta 4468 
Ala Glu lie lie Leu Gin Leu Lys Ser Glu Leu Arg Glu Lys Glu Leu 



4516 



gat cag ate egg gaa gcc atg aac egg atg cag aat gaa att gaa ata 4564 
Asp Gin lie Arg Glu Ala Met Asn Arg Met Gin Asn Glu lie Glu lie 

ctg aaa get gaa aat gac egg ttg aag gca gaa act ggt aac aca get 4612 
Leu Lys Ala Glu Asn Asp Arg Leu Lys Ala Glu Thr Gly Asn Thr Ala 



4660 



4708 



gag get gtt age tea gat att ttg eta gat gat get ggt gat gca act 4756 
Glu Ala Val Ser Ser Asp lie Leu Leu Asp Asp Ala Gly Asp Ala Thr 



4804 



4852 
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tec att ggt gtt agt gga aaa acc aag tgg gat gtc tta gat ggt gta 
Ser lie Gly Val Ser Gly Lys Thr Lys Trp Asp Val Leu Asp Gly Val 



ata aga cgt etc ttt aag gaa tat gta ttc cga att gat aca tec act 
lie Arg Arg Leu Phe Lys Glu Tyr Val Phe Arg lie Asp Thr Ser Thr 



age ctt ggt ctg age tct gac tgc att get age tac tgt ata gga gac 
Ser Leu Gly Leu Ser Ser Asp Cys lie Ala Ser Tyr Cys lie Gly Asp 



gaa gaa aat agt ttg gac agt ttt gtt ttt gat acg ctg att cct aaa 
Glu Glu Asn Ser Leu Asp Ser Phe Val Phe Asp Thr Leu lie Pro Lys 



gtg gag etc cca gtt gta ata att ctt gat aat ctt cat cat gtg ggc 
Val Glu Leu Pro Val Val lie lie Leu Asp Asn Leu His His Val Gly 



4900 



4948 



4996 



tta att aga tec cat aac eta gaa gtg cct gaa ttg ctg cct tgt gga 5044 
Leu lie Arg Ser His Asn Leu Glu Val Pro Glu Leu Leu Pro Cys Gly 

tac ctt gtt gga gat aat aac ate ate act gtg aac etc aaa ggg gta 5092 
Tyr Leu Val Gly Asp Asn Asn lie lie Thr Val Asn Leu Lys Gly Val 



5140 



cca att acc caa agg tac ttt aac ttg ttg atg gag cat cac aga att 5188 
Pro lie Thr Gin Arg Tyr Phe Asn Leu Leu Met Glu His His Arg lie 

ata etc tea gga ccg agt ggt act gga aag acc tat ttg gca aac aaa 5236 
lie Leu Ser Gly Pro Ser Gly Thr Gly Lys Thr Tyr Leu Ala Asn Lys 

ctt get gaa tat gta ata acc aaa tct gga agg aaa aaa aca gag gat 5284 
Leu Ala Glu Tyr Val lie Thr Lys Ser Gly Arg Lys Lys Thr Glu Asp 

gca att gee act ttt aat gtg gac cac aag tea agt aag gaa ttg caa 5332 
Ala lie Ala Thr Phe Asn Val Asp His Lys Ser Ser Lys Glu Leu Gin 

caa tat eta get aac ctg get gaa cag tgc agt get gat aat aat gga 53 80 

Gin Tyr Leu Ala Asn Leu Ala Glu Gin Cys Ser Ala Asp Asn Asn Gly 



5428 



tct ctg agt gat ate ttc aat ggt ttt etc aat tgt aaa tac aac aaa 5476 
Ser Leu Ser Asp lie Phe Asn Gly Phe Leu Asn Cys Lys Tyr Asn Lys 

tgt cca tat att att gga aca atg aat cag gga gtt tct tea tea cca 5524 
Cys Pro Tyr lie lie Gly Thr Met Asn Gin Gly Val Ser Ser Ser Pro 
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aat eta gag ctg cat cac aat ttc agg tgg gta tta tgt gca aat cat 5572 
Asn Leu Glu Leu His His Asn Phe Arg Trp Val Leu Cys Ala Asn His 

aca gaa cca gtg aaa ggc ttt tta ggc aga tat ctt cga aga aaa etc 5620 
Thr Glu Pro Val Lys Gly Phe Leu Gly Arg Tyr Leu Arg Arg Lys Leu 



ata gag ata gaa att gaa agg aac att cgc aat aat gac eta gtc aaa 
lie Glu lie Glu lie Glu Arg Asn lie Arg Asn Asn Asp Leu Val Lys 



cct tgc ccc atg gat gta gaa ggt tct aga gta tgg ttc atg gat etc 
Pro Cys Pro Met Asp Val Glu Gly Ser Arg Val Trp Phe Met Asp Leu 

tgg aac tat tct tta gta cct tat att ctg gag gca gtg aga gag ggt 
Trp Asn Tyr Ser Leu Val Pro Tyr lie Leu Glu Ala Val Arg Glu Gly 



ctt cag atg tat ggg aaa cgc aca cca tgg gaa gat cct tea aag tgg 
Leu Gin Met Tyr Gly Lys Arg Thr Pro Trp Glu Asp Pro Ser Lys Trp 



cca gee tta ctt cag ctg cga cca gaa gat gtt ggg tat gaa age tgc 
Pro Ala Leu Leu Gin Leu Arg Pro Glu Asp Val Gly Tyr Glu Ser Cys 



aca gaa gga gat ccc ctg atg aat atg eta atg aaa etc caa gaa gca 
Thr Glu Gly Asp Pro Leu Met Asn Met Leu Met Lys Leu Gin Glu Ala 

gee aat tac teg age aca caa age tgc gac age gaa age ace agt cac 
Ala Asn Tyr Ser Ser Thr Gin Ser Cys Asp Ser Glu Ser Thr Ser His 



5668 



att ata gat tgg att ccg aag acg tgg cat cat etc aac agt ttt ttg 5716 
lie lie Asp Trp lie Pro Lys Thr Trp His His Leu Asn Ser Phe Leu 

gaa aca cac agt tct tct gac gtt ace att ggt- ccc cga eta ttc ctt 5764 
Glu Thr His Ser Ser Ser Asp Val Thr lie Gly Pro Arg Leu Phe Leu 



5812 



5860 



5908 



gtg ctt gac aca tat cca tgg age tea gca act ctg cct cag gag age 5956 
Val Leu Asp Thr Tyr Pro Trp Ser Ser Ala Thr Leu Pro Gin Glu Ser 



6004 



aca tec act aag gaa gec aca ace tea aag cac att cca caa act gac 6052 
Thr Ser Thr Lys Glu Ala Thr Thr Ser Lys His He Pro Gin Thr Asp 



6100 



6148 



cat gaa gac att ttg gat tea tct ctt gaa tct ace etc tagagggtga 6197 
His Glu Asp lie Leu Asp Ser Ser Leu Glu Ser Thr Leu 

aaaaagttaa gggaaaagac tttgetttta aaaaaatgtt tcaaaagaaa ggtattttca 6257 
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ctaaaccact 


gccagtataa 


aagcaccctg 


tcaagggccc 


tgacccagag 


ttgtggtctc 


6317 


caaggaggca 


gcagaactaa 


gtctgaaccg 


ccaagatgct 


aaattgcaat 


ggaagcttaa 


6377 


ctttagttta 


tttctaaaca 


ttttttatat 


ctgtggagta 


atagaaagct 


ccattactca 


6437 


actggaaagg 


accctaatga 


cagggcaact 


gaacagattg 


cacatgggat 


agccaaactg 


6497 


gactttcttt 


gtttcctctt 


taaaagttta 


caatgcagac 


cattttttgt 


cccttccttt 


6557 


tgtttcctct 


gaggggctgt 


tcgccccagg 


cagggtccat 


ctttctgatc 


tgtccaacct 


6617 


cctttgtgcc 


acacggtgct 


ggtcacaggg 


cttcagtagt 


gtttgtgttg 


tgcgctcacc 


6677 


ccattccaga 


acaaatccaa 


gaggccagtc 


ctccataagc 


acaaatggaa 


ttgtgcaacc 


6737 


accagaaaaa 


cactactgtg 


gcaaactgga 


gaagtgccaa 


tttaattcta 


actgccacgt 


6797 


tctcatgatg 


tgctccacca 


actttttagt 


atatgagtca 


ctggttttat 


aaggttgttt 


6857 


ttaccacagt 


ggtcttttta 


aaccacctgc 


ccactccctt 


aacaagagtt 


ttataccaat 


6917 


tattagtcaa 


cactgataaa 


aggctttttt 


agggctttat 


ttgtttgagc 


cttttcagtg 


6977 


aaagaaggaa 


catttcctat 


ggtgctgtct 


cactgcctta 


aaacagattt 


ctatgacagt 


7037 


ttaacagttg 


gtttaaatcc 


taaaccattg 


gtaatttcca 


ctgtcttttc 


atttacaacc 


7097 


aagcaacacc 


agttaacata 


gtagcctcat 


ctctatatat 


ctttctcttt 


tttttttttt 


7157 


tgaagaaatg 


gataggagaa 


agatcagtat 


ttttagcctt 


gtgaatagat 


cgctttgcct 


7217 


atcctccaaa 


atattaaaat 


aacccagaaa 


tgctctttga 


ccgtcactta 


aaacctaaga 


7277 


catgtggcga 


aattccatcc 


agttctaagt 


gaaagagttt 


cagaaggcag 


gagattttga 


7337 


attattatcc 


agcagggctg 


gaagcactag 


atgcagcatg 


agcacaacta 


ttcggctttc 


7397 


cttccctatt 


gtttttgttt 


ttttaatgag 


ttttgacgca 


tgttgttttg 


attgctattg 


7457 


ttgtacatga 


gaaattcagc 


attaaagaac 


actgaagcgg 


taaggtcact 


gtggaagagg 


7517 


aagcgtttat 


actgtaaaag 


aaggttagat 


ttgcacagtc 


tactgggtag 


gtattgtaaa 


7577 


taataatttt 


taaaacttgc 


acaaatcaaa 


acaaacacaa 


acaaaattgt 


attttatcct 


7637 


gttggtgtta 


agaggtgttt 


cacttgctga 


gatttcctgt 


acattgcaaa 


caaatacaga 


7697 


atgcaaaccc 


tcaaagctgt 


attatctggt 


gtgtttgtcc 


tgtatttaca 


gttgtttttg 


7757 


actatgcagg 


agctatcagt 


gctagagtga 


gcatgcttca 


aaactgtaca 


tgaagccaat 


7817 


atatttttgg 


ataagtaaaa 


ctgtctgaaa 


gtacatctgt 


catggcaggc 


tttaaagaga 


7877 


gtgcatgaaa 


actgatcagt 


cattggagaa 


gttaccacca 


cacacaaagg 


acaggtttta 


7937 
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agtttatgaa 


acccaagggc 


taggccatgg 


tatagacttc 


ttctatgagt 


gtgtgaaaat 


7997 


gtgttacttt 


taggacgtgt 


atttggtgct 


actctctgtg 


accaccaatg 


ggtcagttgc 


8057 


tatagaacaa 


caacaccacg 


aaacatctgt 


gcagttttca 


gagtgtcaca 


aagtcaatag 


8117 


gtccttacac 


ggtgctattg 


ccctaaggga 


aatccgaact 


gaatttatgc 


acatagaatt . 


8177 


gtcaccctga 


ctttgaagcc 


tcaaacatgg 


atcaaatctg 


ttgtgaaaca 


tcaatatatg 


8237 


tagctggatg 


agtgactagt 


ttcccttgta 


taatatgtga 


tctaagaaaa 


ttgctaatct 


8297 


ttccctgcca 


ttttgagaaa 


cacagtccaa 


acatgagcat 


aaacagaatt 


tcctgcaata 


8357 


catcccagta 


ggtccaccta 


gtttacaact 


taaactagtt 


tgtgaaacat 


ttgtctgtat 


8417 
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tt tgatgt aa 


catatcatgt 


aaataggcag 


aaacagtgaa 


8477 


ataaatcatc 


tgaaaagttt 


tgtagtcttt 


gtaaagcccc 


aacaataagt 


acttggtgtc 


8537 


aatggactta 


actggatgat 


gtattttcta 


ttggtttatt 


gttcctctag 


cttgtaaacc 


8597 


agcttgcata 


tatttttttg 


caaatgtgca 


ccctgtatct 


gtctaaatta 


ttactttgcc 


8657 


attaaagtgg 


aattatttat 


tgacaaaaaa 


aaa 






8690 



<210> 12 
<211> 1895 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Asp Leu Ser Ser Glu Met Asn Arg His Gly Lys Asn Pro Val Ser 
15 10 15 

His Lys Leu Glu Asp Gin Lys Lys lie Tyr Thr Asp Trp Ala Asn His 
20 25 30 

Tyr Leu Ala Lys Ser Gly His Lys Arg Leu lie Lys Asp Leu Gin Gin 
35 40 45 

Asp He Ala Asp Gly Val Leu Leu Ala Glu He He Gin He He Ala 
50 55 60 

Asn Glu Lys Val Glu Asp He Asn Gly Cys Pro Arg Ser Gin Ser Gin 
65 70 75 80 

Met He Glu Asn Val Asp Val Cys Leu Ser Phe Leu Ala Ala Arg Gly 
85 90 95 

Val Asn Val Gin Gly ^Leu Ser Ala Glu Glu He Arg Asn Gly Asn Leu 
100 ' 105 HO 

Lys Ala He Leu Gly Leu Phe Phe Ser Leu Ser Arg Tyr Lys Gin Gin 
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115 120 125 

Gin His His Gin Gin Gin Tyr Tyr Gin Ser Leu Val Glu Leu Gin Gin 
130 135 140 

Arg Val Thr His Ala Ser Pro Pro Ser Glu Ala Ser Gin Ala Lys Thr 
145 150 155 160 

Gin Gin Asp Met Gin Ser Arg Leu Pro Gly Pro Ser Arg Val Pro Ala 
165 170 175 

Ala Gly Ser Ser Ser Lys Val Gin Gly Ala Ser Asn Leu Asn Arg Arg 
180 185 190 

Ser Gin Ser Phe Asn Ser lie Asp Lys Asn Lys Pro Pro Asn Tyr Ala 
195 200 205 

Asn Gly Asn Glu Lys Gly Glu Asp Pro Glu Thr Arg Arg Met Arg Thr 
210 215 220 

Val Lys Asn lie Ala Asp Leu Arg Gin Asn Leu Glu Glu Thr Met Ser 
225 230 235 240 

Ser Leu Arg Gly Thr Gin lie Ser His Ser Thr Leu Glu Thr Thr Phe 
245 250 255 

Asp Ser Thr Val Thr Thr Glu Val Asn Gly Arg Thr lie Pro Asn Leu 
260 265 270 

Thr Ser Arg Pro Thr Pro Met Thr Trp Arg Leu Gly Gin Ala Cys Pro 
275 280 285 

Arg Leu Gin Ala Gly Asp Ala Pro Ser Leu Gly Ala Gly Tyr Pro Arg 
290 295 300 

Ser Gly Thr Ser Arg Phe lie His Thr Asp Pro Ser Arg Phe Met Tyr 
305 310 315 320 

Thr Thr Pro Leu Arg Arg Ala Ala Val Ser Arg Leu Gly Asn Met Ser 
325 330 335 

Gin lie Asp Met Ser Glu Lys Ala Ser Ser Asp Leu Asp Met Ser Ser 
340 345 350 

Glu Val Asp Val Gly Gly Tyr Met Ser Asp Gly Asp lie Leu Gly Lys 
355 360 365 

Ser Leu Arg Thr Asp Asp lie Asn Ser Gly Tyr Met Thr Asp Gly Gly 
370 375 380 

Leu Asn Leu Tyr Thr Arg Ser Leu Asn Arg lie Pro Asp Thr Ala Thr 
385 390 395 400 

Ser Arg Asp lie lie Gin Arg Gly Val His Asp Val Thr Val Asp Ala 
405 410 415 

Asp Ser Trp Asp Asp Ser Ser Ser Val Ser Ser Gly Leu Ser Asp Thr 
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420 425 430 

Leu Asp Asn lie Ser Thr Asp Asp Leu Asn Thr Thr Ser Ser Val Ser 
435 440 445 



Ser Tyr Ser Asn lie Thr Val Pro Ser Arg Lys Asn Thr Gin Leu Arg 
450 455 460 

Thr Asp Ser Glu Lys Arg Ser Thr Thr Asp Glu Thr Trp Asp Ser Pro 
465 470 475 480 

Glu Glu Leu Lys Lys Pro Glu Glu Asp Phe Asp Ser His Gly Asp Ala 
485 490 495 

Gly Gly Lys Trp Lys Thr Val Ser Ser Gly Leu Pro Glu Asp Pro Glu 
500 505 510 

Lys Ala Gly Gin Lys Ala Ser Leu Ser Val Ser Gin Thr Gly Ser Trp 
515 520 525 

Arg Arg Gly Met Ser Ala Gin Gly Gly Ala Pro Ser Arg Gin Lys Ala 
530 535 540 

Gly Thr Ser Ala Leu Lys Thr Pro Gly Lys Thr Asp Asp Ala Lys Ala 
545 550 555 560 

Ser Glu Lys Gly Lys Ala Pro Leu Lys Gly Ser Ser Leu Gin Arg Ser 
565 570 575 

Pro Ser Asp Ala Gly Lys Ser Ser Gly Asp Glu Gly Lys Lys Pro Pro 
580 585 590 

Ser Gly lie Gly Arg Ser Thr Ala Thr Ser Ser Phe Gly Phe Lys Lys 
595 600 605 

Pro Ser Gly Val Gly Ser Ser Ala Met He Thr Ser Ser Gly Ala Thr 
610 615 620 

He Thr Ser Gly Ser Ala Thr Leu Gly Lys He Pro Lys Ser Ala Ala 
625 630 635 640 

He Gly Gly Lys Ser Asn Ala Gly Arg Lys Thr Ser Leu Asp Gly Ser 
645 650 655 

Gin Asn Gin Asp Asp Val Val Leu His Val Ser Ser Lys Thr Thr Leu 
660 665 670 

Gin Tyr Arg Ser Leu Pro Arg Pro Ser Lys Ser Ser Thr Ser Gly He 
675 680 685 

Pro Gly Arg Gly Gly His Arg Ser Ser Thr Ser Ser He Asp Ser Asn 
690 695 700 

Val Ser Ser Lys Ser Ala Gly Ala Thr Thr Ser Lys Leu Arg Glu Pro 
705 710 715 720 
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Thr Lys lie Gly Ser 
725 

Asp Lys Glu Lys Glu 

740 

Leu Ser Gly Ser Pro 
755 

Ala Gin Gly Leu Arg 
770 

Pro Thr Phe Arg Arg 
785 



Gly Arg Ser Ser Pro Val 
730 

Lys Val Ala Val Ser Asp 
745 

Lys Ser Ser Pro Thr Ser 
760 

Gin Pro Gly Ser Lys Tyr 
775 

Leu Phe Gly Ala Lys Ala 
790 795 



Thr Val Asn Gin Thr 
735 

Ser Glu Ser Val Ser 
750 

Ala Ser Ala Cys Gly 
765 

Pro Asp lie Ala Ser 
780 

Gly Gly Lys Ser Ala 
800 



Ser Ala Pro Asn Thr Glu Gly Val Lys Ser Ser Ser Val Met Pro Ser 
805 810 815 

Pro Ser Thr Thr Leu Ala Arg Gin Gly Ser Leu Glu Ser Pro Ser Ser 
820 825 830 

Gly Thr Gly Ser Met Gly Ser Ala Gly Gly Leu Ser Gly Ser Ser Ser 
835 840 845 

Pro Leu Phe Asn Lys Pro Ser Asp Leu Thr Thr Asp Val lie Ser Leu 
850 855 860 

Ser His Ser Leu Ala Ser Ser Pro Ala Ser Val His Ser Phe Thr Ser 
865 870 875 880 

Gly Gly Leu Val Trp Ala Ala Asn Met Ser Ser Ser Ser Ala Gly Ser 
885 890 895 

Lys Asp Thr Pro Ser Tyr Gin Ser Met Thr Ser Leu His Thr Ser Ser 
900 905 910 

Glu Ser lie Asp Leu Pro Leu Ser His His Gly Ser Leu Ser Gly Leu 
915 920 925 

Thr Thr Gly Thr His Glu Val Gin Ser Leu Leu Met Arg Thr Gly Ser 
930 935 940 

Val Arg Ser Thr Leu Ser Glu Ser Met Gin Leu Asp Arg Asn Thr Leu 
945 950 955 960 

Pro Lys Lys Gly Leu Arg Tyr Thr Pro Ser Ser Arg Gin Ala Asn Gin 
965 970 975 

Glu Glu Gly Lys Glu Trp Leu Arg Ser His Ser Thr Gly Gly Leu Gin 
980 985 990 

Asp Thr Gly Asn Gin Ser Pro Leu Val Ser Pro Ser Ala Met Ser Ser 
995 1000 1005 

Ser Ala Ala Gly Lys Tyr His Phe Ser Asn Leu Val Ser Pro Thr Asn 
1010 1015 1020 
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Leu Ser Gin Phe Asn Leu Pro Gly Pro Ser Met Met Arg Ser Asn Ser 
1025 1030 1035 1040 

lie Pro Ala Gin Asp Ser Ser Phe Asp Leu Tyr Asp Asp Ser Gin Leu 
1045 1050 1055 

Cys Gly Ser Ala Thr Ser Leu Glu Glu Arg Pro Arg Ala lie Ser His 
1060 1065 1070 

Ser Gly Ser Phe Arg Asp Ser Met Glu Glu Val His Gly Ser Ser Leu 
1075 1080 1085 

Ser Leu Val Ser Ser Thr Ser Ser Leu Tyr Ser Thr Ala Glu Glu Lys 
1090 1095 1100 

Ala His Ser Glu Gin lie His Lys Leu Arg Arg Glu Leu Val Ala Ser 
1105 1110 1115 1120 

Gin Glu Lys Val Ala Thr Leu Thr Ser Gin Leu Ser Ala Asn Ala His 
1125 1130 1135 

Leu Val Ala Ala Phe Glu Lys Ser Leu Gly Asn Met Thr Gly Arg Leu 
1140 1145 1150 



Gin Ser Leu Thr Met Thr Ala Glu Gin Lys Glu Ser Glu Leu lie Glu 
1155 1160 1165 

Leu Arg Glu Thr lie Glu Met Leu Lys Ala Gin Asn Ser Ala Ala Gin 
1170 1175 1180 

Ala Ala lie Gin Gly Ala Leu Asn Gly Pro Asp His Pro Pro Lys Asp 
1185 1190 1195 1200 

Leu Arg lie Arg Arg Gin His Ser Ser Glu Ser Val Ser Ser lie Asn 
1205 1210 1215 

Ser Ala Thr Ser His Ser Ser lie Gly Ser Gly Asn Asp Ala Asp Ser 
1220 1225 1230 

Lys Lys Lys Lys Lys Lys Asn Trp Leu Arg Ser Ser Phe Lys Gin Ala 
1235 1240 1245 

Phe Gly Lys Lys Lys Ser Thr Lys Pro Pro Ser Ser His Ser Asp lie 
1250 1255 1260 

Glu Glu Leu Thr Asp Ser Ser Leu Pro Ala Ser Pro Lys Leu Pro His 
1265 1270 1275 1280 

Asn Ala Gly Asp Cys Gly Ser Ala Ser Met Lys Pro Ser Gin Ser Ala 
1285 1290 1295 

Ser Ala lie Cys Glu Cys Thr Glu Ala Glu Ala Glu lie lie Leu Gin 
1300 1305 13i0 

Leu Lys Ser Glu Leu Arg Glu Lys Glu Leu Lys Leu Thr Asp lie Arg 
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1315 1320 1325 

Leu Glu Ala Leu Ser Ser Ala His His Leu Asp Gin lie Arg Glu Ala 
1330 1335 1340 

Met Asn Arg Met Gin Asn Glu lie Glu lie Leu Lys Ala Glu Asn Asp 
1345 1350 1355 1360 

Arg Leu Lys Ala Glu Thr Gly Asn Thr Ala Lys Pro Thr Arg Pro Pro 
1365 1370 1375 

Ser Glu Ser Ser Ser Ser Thr Ser Ser Ser Ser Ser Arg Gin Ser Leu 
1380 1385 1390 

Gly Leu Ser Leu Asn Asn Leu Asn lie Thr Glu Ala Val Ser Ser Asp 
1395 1400 1405 

lie Leu Leu Asp Asp Ala Gly Asp Ala Thr Gly His Lys Asp Gly Arg 
1410 1415 1420 

Ser Val Lys lie lie Val Ser lie Ser Lys Gly Tyr Gly Arg Ala Lys 
1425 1430 1435 1440 

Asp Gin Lys Ser Gin Ala Tyr Leu lie Gly Ser lie Gly Val Ser Gly 
1445 1450 1455 

Lys Thr Lys Trp Asp Val Leu Asp Gly Val lie Arg Arg Leu Phe Lys 
1460 1465 1470 

Glu Tyr Val Phe Arg lie Asp Thr Ser Thr Ser Leu Gly Leu Ser Ser 
1475 1480 1485 

Asp Cys lie Ala Ser Tyr Cys lie Gly Asp Leu lie Arg Ser His Asn 
1490 1495 1500 



Leu Glu Val Pro Glu Leu Leu Pro Cys Gly Tyr Leu Val Gly Asp Asn 
1505 1510 1515 1520 

Asn lie lie Thr Val Asn Leu Lys Gly Val Glu Glu Asn Ser Leu Asp 
1525 1530 1535 

Ser Phe Val Phe Asp Thr Leu lie Pro Lys Pro lie Thr Gin Arg Tyr 
1540 i 1545 1550 



Phe Asn Leu Leu Met Glu His His Arg lie lie Leu Ser Gly Pro Ser 
1555 1560 1565 

Gly Thr Gly Lys Thr Tyr Leu Ala Asn Lys Leu Ala Glu Tyr Val lie 
1570 1575 1580 

Thr Lys Ser Gly Arg Lys Lys Thr Glu Asp Ala lie Ala Thr Phe Asn 
1585 1590 1595 1600 

Val Asp His Lys Ser Ser Lys Glu Leu Gin Gin Tyr Leu Ala Asn Leu 
1605 1610 1615 
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Ala Glu Gin Cys Ser Ala Asp Asn Asn Gly Val Glu Leu Pro Val Val 
1620 1625 1630 

lie lie Leu Asp Asn Leu His His Val Gly Ser Leu Ser Asp lie Phe 
1635 1640 1645 

Asn Gly Phe Leu Asn Cys Lys Tyr Asn Lys Cys Pro Tyr lie lie Gly 
1650 1655 1660 

Thr Met Asn Gin Gly Val Ser Ser Ser Pro Asn Leu Glu Leu His His 
1665 1670 1675 1680 

Asn Phe Arg Trp Val Leu Cys Ala Asn His Thr Glu Pro Val Lys Gly 
1685 1690 1695 

Phe Leu Gly Arg Tyr Leu Arg Arg Lys Leu lie Glu lie Glu lie Glu 
1700 1705 1710 

Arg Asn lie Arg Asn Asn Asp Leu Val Lys lie lie Asp Trp lie Pro 
1715 1720 1725 

Lys Thr Trp His His Leu Asn Ser Phe Leu Glu Thr His Ser Ser Ser 
1730 1735 1740 

Asp Val Thr lie Gly Pro Arg Leu Phe Leu Pro Cys Pro Met Asp Val 
1745 1750 1755 1760 

Glu Gly Ser Arg Val Trp Phe Met Asp Leu Trp Asn Tyr Ser Leu Val 
1765 1770 1775 

Pro Tyr lie Leu Glu Ala Val Arg Glu Gly Leu Gin Met Tyr Gly Lys 
1780 1785 1790 

Arg Thr Pro Trp Glu Asp Pro Ser Lys Trp Val Leu Asp Thr Tyr Pro 
1795 1800 1805 

Trp Ser Ser Ala Thr Leu Pro Gin Glu Ser Pro Ala Leu Leu Gin Leu 
1810 1815 1820 

Arg Pro Glu Asp Val Gly Tyr Glu Ser Cys Thr Ser Thr Lys Glu Ala 
1825 1830 1835 1840 

Thr Thr Ser Lys His lie Pro Gin Thr Asp Thr Glu Gly Asp Pro Leu 
1845 1850 1855 

Met Asn Met Leu Met Lys Leu Gin Glu Ala Ala Asn Tyr Ser Ser Thr 
1860 1865 1870 

Gin Ser Cys Asp Ser Glu Ser Thr Ser His His Glu Asp lie Leu Asp 
1875 1880 1885 

Ser Ser Leu Glu Ser Thr Leu 
1890 1895 



<210> 13 
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<211> 3682 

<212> DNA 

<213> Homo sapiens 

<400> 13 



gaattcctgg 


tggagaacag 


cacatgtaca 


gatggggtga 


gaacagcata 


cgtacaggta 


60 


ggggtaagct 


ggtgctatat 


gagaaagcat 


ggaataagtt 


attaagtttg 


acctgcttgg 


120 


gaactgaggg 


gcaggtgtga 


gggatgaagc 


aggagtaggt 


aggggctaga 


tcacaaaaga 


180 


tctatgccag 


tgtttctcac 


agtgtgattc 


ccagcccagt 


agcatgatat 


cacttgggat 


240 


cttgttagaa 


atacaaattc 


ttatacatca 


ccctggacta 


gaccacctga 


ataagaaaag 


300 


ttgggcatga 


ggcctacaaa 


tttttaaaaa 


agtcatacag 


gtgattgcaa 


tgcatgctaa 


360 


agtttgagaa 


acactctttg 


ctgtggtttg 


aatatttgtg 


tccttccaaa 


attcatgtag 


420 


aaaccatctc 


caatgttata 


gtattaagag 


gagggaccct 


tgggagctga 


tcagatcatg 


480 


aagtctcctt 


tcttataaag 


gggattaaaa 


gccttggccc 


ttttaccctt 


tgtccatgta 


540 


aggacacagt 


gttggaagca 


gggactgggt 


tctcaccaga 


aacagaacct 


gccagcctct 


600 


tggtcttgga 


cttctcagcc 


tccacaattg 


tgagaaataa 


gtttctgttg 


tttataagtt 


660 


aaccagtctc 


aggtattttg 


taatggcagc 


acaaaggggc 


taagaaactg 


ttctatgccc 


720 


taacaagaaa 


tgtggtcact 


ttcctgaagg 


aaatggggat 


atatataaag 


atgttatata 


780 


agactcgtaa 


tatttatttg 


gaaggcttgc 


tctgcaagca 


aggtggaaga 


gcaacatgaa 


840 


ggaagcgtgg 


tggaggtgag 


aggactggag 


gttaagttgg 


tagggagata 


caggaaagaa 


900 


gcttatgaca 


cttgagttaa 


aatgtagcat 


ccttcctatg 


tgtagggctc 


ataaaaatgt 


960 


atagtctaag 


atagaacaca 


gaatactcta 


tgaatcctgc 


ccacaaggtg 


ttggtaatct 


1020 


agattcactt 


tttttttctg 


ataatgccat 


ccatatgtat 


ggagcgtcta 


ctactgtatg 


1080 


ccagagtgac 


tctggaatcg 


gtttggttga 


tctagacaag 


accataagga 


gagtcccctt 


1140 


actacctctt 


ctccagggga 


gggattcaag 


ttgaactagt 


acttcagaga 


ctgtttagta 


1200 


atatcatgca 


tgaaaggtga 


tggttaggac 


agaaaaataa 


atggattgca 


tcataattcc 


1260 


tcaggttctc 


caaatatgtg 


gtggtctcaa 


accatgtgaa 


ttggtctgca 


catcctgttt 


1320 


gggttgcgtg 


tcagcagttg 


agatctgagc 


cttatttgta 


acagtgaaac 


agtgagagac 


1380 


ctgcccttca 


agagctgttt 


ttcagctagg 


aatagaaaag 


ggccaggcta 


gactcctctt 


1440 


tctgctggat 


cttgcttctt 


ctcagcaata 


gaagtagacc 


tgccttccta 


gctgtagaga 


1500 
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aaaggtgccg 


gtaggcgggc 


aggtgagcct 


gtggataatc 


ctggagtaaa 


ggttcaatag 


1560 


accttcaagt 


ctatcctaca 


ggattcggag 


tgaggggaga 


gaaaaggaga 


cgcttctctg 


1620 


gctgagagag 


gaagagaaaa 


aaaaatccca 


gatatctgac 


agctatatct 


tcccatcacc 


1680 


accttcctct 


aaacccatgc 


ctctctgttt 


agtaggacat 


aaaatgaaga 


gtgacccacc 


1740 


ccccaccccc 


agcccatccc 


ccgtttgtag 


gtgtgctttc 


aatgaaaata 


agtcggtgtt 


1800 


catggacgga 


aactagagca 


gctgaaaata 


gatgcaagac 


ttgttgagca 


tacaaatcat 


1860 


ttccccctta 


gtctccaagg 


gaggaaaaaa 


aatccctctt 


actctccttg 


cagcctgtgt 


1920 


tctgcattct 


ggagaggaag 


ctgaggctgg 


tcctcaggcg 


ctcctcccgc 


cgttcccgca 


1980 


ggaaactttt 


ctcgcagggc 


ccgctccgtc 


catcccgcgc 


ggttccaaga 


cggtgggcct 


2040 


cccgtgggct 


cctctcctgg 


gcaagggccc 


agaccccgcg 


acgcgcctgt 


ctctttaaat 


2100 


tccagctgcg 


cggctgggaa 


acagcgccac 


tcgccgccca 


ggccggctgg 


aggctgaaga 


2160 


gcgagctcgc 


gctttcgctc 


ccggctgcgc 


gccgcggaga 


gctgggctcg 


gcccgcgggc 


2220 


tgctaggtgg 


cggcggcgcg 


gggcggggag 


gcgcggcccg 


gcggaggagg 


gaagaaagag 


2280 


cgagccgggc 


cgggagaggc 


gccgcgcccg 


gtcccgcgcc 


cggtcccgca 


cccgctctca 


2340 


gcggcccaag 


cagtttcttt 


ctgggtgaca 


agaatgtgcc 


tcggttggtt 


tttctttttt 


2400 


ttctccatct 


ccttaagacg 


atttccatag 


taacctgatc 


aagtggctca 


aaatcgcaaa 


2460 


cctgaggatt 


tccgcggccc 


gccggcaaga 


cctcggccag 


gtaacgctgc 


gatctcctcc 


2520 


tcttccattg 


caaaccgctg 


cgctccttgc 


aaagttcctt 


ttgtggaaaa 


tcgcccagcc 


2580 


caagggagcc 


cggggtattt 


gcaacagcgt 


gttcatttcc 


aggtgcctgt 


cacgggtctc 


2640 


ctccctgctg 


cttctccagg 


acccatgatg 


agattatttt 


taaaaattgt 


ttttggtcgt 


2700 


ctcccccgcc 


ccctcccctt 


ctttattttt 


ttcctcttcg 


ctgcactctt 


ctcggctttt 


2760 


cccctgacac 


tactgatggg 


ggtgcggggg 


gacgtcgggg 


atgggggtgg 


ccagcgcggt 


2820 


cctgggagtg 


gcgggttcgg 


atgggctggc 


tgcggtgggc 


cactttgggc 


atctcggcgt 


2880 


ggcctgcgcc 


ggggtcacgg 


ggagggctgt 


cagcgccagg 


gcggcggaac 


ccgaggtctc 


2940 


cagacgagtg 


agggagggat 




yyy u y ciL 'yy a 


gcgc t tggc t 


acr t crcr c t aa t 


3000 


gagcgtccat 


acatcatagc 


tctccttccc 


actcccccgc 


ccctcttcgg 


gattctctct 


3060 


ttctctttcc 


ccgtcctcat 


ttctttcttc 


ctttactcac 


cactcgcttc 


attctcttcc 


3120 


ttccatttcc 


tctttttttc 


tcccctcatt 


tccttttttt 


cctttccctt 


ttaaagaaag 


3180 


gggaatcgtt 


tgtaaccctt 


tcgttctacc 


aacgtggaat 


agctgtgaaa 


cctgcagcgt 


3240 
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ggtcacctca gcctggtcgt tttcagaccc gtcctcatcc atcaacatat ttgtttcccg 3300 

agtctattga tctccctgaa ttctacagaa atgcattcta agctaggcgc ctgtatgtca 3360 

gaatcagttc tgcaggtagc ttccgtgctc caagtatgac atgtattgta agggctgcat 342 0 

ctgttttaaa cccacataag ccatgggtat aaataaatgt agctttgaaa aaaaatctgg 3480 

ccttattcta gataaacttc cctcttaaat tactgatata ctcttctccc tctttgacat 3540 

ttaattttag gaaagttggg agacaggttc ttgtcctcca gtttttaagg agcaggcaac 3600 

ttctattatc ttaattttct cgtctttgaa catcactcac gtttgcacta cccagtcagt 3660 

ggaacgagtg ggtcataatt aa 3 682 



<210> 14 

<211> 3904 

<212> DNA 

<2 13 > Homo sapiens 

<400> 14 

cctgcattat tgtttttatc tgacttccaa ttttggtgtt ccctgggtgg gtgggttttc 60 
ctgacacatt tacaagatgc ttttggcagg ttggctggaa tttgaaggca catttaattg 120 
taggtgcaat aaaatattca ttttctcttg ttcttggttt gagatgtcat gcccttttgg 180 
tcacttatat tttggtgtga ctgtgtgtgt gtgtgtatgt gtttgtgtga aggatttaac 240 
aaagtctgtt ctaactgtca tgtgatttga agttaaaagg tatgttagtg acaagccaca 300 
aatttctctt atttatagta cattgatcct gaaaccattt tttcccttgt gatttcttct 
gtgcatggat catttaacga aaggttggca atgatgagct atttttttat aataggaaaa 
aaattcctca agtttactta ccaagtcata tttttataca gagggattag caaatatttc 



360 
420 
480 



tgatctaata ttttaataga ctgaattgct gaccactgct aattaccaag aatatatttt 540 
cttaattctg aaattgctgt acctctcaag ttgtctggag gactccaagt gacccaactt 600 
gtaactcatg gcaacaggaa gtggttgttc tgggtgcaag ctgaagtgtg cacatggacc 660 
cgtactttgt tagcactcgg ggacttgata tggaaagaat taatgtactg gcttttttgt 720 
atagatgaat gttaactttc tgacattagt cagaactaca tctcccaagc cttgttttgc 
agtgtctgtc cctttgctct tcacttacag taagtcctta cttaactgac ttgataggtt 
cttggaaact gcaactttaa gcaaaaggaa gtataatgaa acacttttat cacaggctaa 
ttggtagaaa caagacttaa gttcccatgg catatttctg gtcacaaaaa catttccaaa 

34 



780 
840 
900 
960 
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cttctcaaaa cacttcaata ttaagcattc aaatacatgt aaactatgta tatatgtaag 1020 
aaaggttact ataaaccaga tcaatattta cccaattatt taagttcagg gtcttaggtg 1080 
gctggagcct atccgagtag ctcagggcac aaggcgggaa ccagccctag acaggacacc 1140 
atcctgttgc agggcacgtt cacacatgcc cacacgcagg ctgggaccat ttacatgtgc 1200 
caattcacct accatgcaca tctttgagac gtggcaggaa gcaagagtac ctggagaaaa 1260 
tccatacaga tatggggaga atgtacaaac tccacccaga cagtggaccc agccaggaat 1320 
caacatttgg gcaacattat aatgaaacga agttgaatga aatgatgtcg ttccacgacc 
tgctgtactt gaggggtgtt ataaaattct cagaagacag aggtttaatg ctatcttttt 
aatagaaaat aacttataga gaagtgtgca catgtgactt tgtgtgtagc aggaatcatt 
aggatgagaa tcagacgtaa gaggtggtgc caacatgagg aatgttgaga ttcagggagc 
tgtggatgga agtagaagcc agaaggccag ggttaggttc ctacttctta ctgtttcagt 
tattgcagtg ttggcctgtt tattcacaga tgtcacctag ctttgttttc tcaagaagaa 
aaatgagcat aatctttcct gttatgaatt cttaaacaca caggacataa ccacagacac 
agaggtgcac atatgtagca gtaatggata ctaaatgata cactcggagg aaacagaaaa 1800 
gacttctgaa tagagactgg agatacttcc ttggaccatt gatgaatggg caatgatgca 
tttttgtctt ccattcagaa ggctaatata ttgctctcta tgttctatgg ataaaggcag 
tatatgctca aggatgaatc acataatatg cataataaat ccagcaagca ttaccctttt 
acttatgtga ctgcaagtag gaatacattt cccccactct aaccatgtaa gatttctttc 
ccttctccca ttttgtaagc aaaagtaagt tcctgaaagg ttaaatggac ctcaggatgg 2100 
gaaaaatccc cagagctatc tttctgcaca gacttcattt tttctcccaa gtctgactgt 
caactgcgat atctgatatg aggctctggt gctgatgttt ccataggtca tcatccttcg 
gtgtcccaga tgaagtctca ggtcgaacat tgcaatagca cagattctga atttaatgca 
tcattaaagt tggttatgta acccaatggc cttgttaaac tccagatttt taaaattata 2340 
tgtatttact attctcttat tttagaatga tctcacaatg ttcacaagaa ataagcccag 2400 
tccctgcaaa gactttaaaa gctgcttgtt cacatcatta gattgtacaa cgcttgtaca 
atgacacttt ttgctaatct atgcaacatt tttgtaacaa ttgtgcacat tttaactact 
tcagataatc aggacctaga gacttcaaga tctggaagca ttgctggtga catagagcaa 
aaactttctt gagaatagga agtcagtgtt ttgacaagtg atttataaca gttcaggtat 2640 



1380 
1440 
1500 
1560 
1620 
1680 
1740 



1860 
1920 
1980 
2040 



2160 
2220 
2280 



2460 
2520 
2580 
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agccaggaag 


gtttgaaaca 


aaccttaagt 


attatttctt 


tcatcttgat 


tagtatatat 


2700 


ttatatgtga 


tctatttatg 


tatattaata 


gatttttggg 


tcttatagcc 


agctttcatt 


2760 


tttctctatt 


ggaaaagatc 


taagtcccca 


tccttccttg 


gtggcttttg 


gtaggtttgt 


2820 


agacaaaaca 


ttgaagaatc 


aatggtacct 


tttatacatt 


aatactgcca 


atatgaccat 


2880 


aaaatcatat 


tttttgggaa 


tttattcccc 


cgatcaaaag 


aagcatttgt 


tattgaacac 


2940 


agtcttatgc 


taccttatta 


agatgtatca 


aacaccctga 


ttgatcaaaa 


acacctcagt 


3000 


ccattttaag 


gcagtattgc 


ccagcaatta 


aagatgtagc 


ttctggagga 


gtctttctga 


3060 


gtttgaattc 


agtactcttc 


cacgtactat 


ataggtgatc 


ttgggtaaac 


ttcttgagtc 


3120 


tcagtatccc 


catctgtaaa 


attgttgtag 


agaagaattt 


ttgtgatgat 


taggtgagag 


3180 


aatatattaa 


tgtaatattt 


aggagagcaa 


ccagcatgta 


gcatatattc 


attacatatc 


3240 


aatttctata 


ttattgatgt 


tcatactgct 


gatgttgaaa 


tgcacaggaa 


ggccacagtt 


3300 


attttctgtt 


tagattgatt 


tttcttttaa 


agtctgaaca 


taaactgtaa 


tactgtgctt 


3360 


atttatgtag 


gaactgtgat 


ctcgtctcct 


ccttttccca 


tctccccctc 


tctaccttag 


3420 


tttttcctta 


tagtctcaag 


ctgaaaacaa 


tgaccaggtg 


cctaagagat 


aagaatactc 


3480 


tttcttttga 


actcatggca 


ttagcagtga 


cctggatgag 


attggaggct 


attattctaa 


3540 


gtgaaatagc 


tcaggaatgg 


aaaaccaagc 


attgtatgtt 


cttacttata 


agtgggagct 


3600 


aagctatgag 


gatacaaagg 


cataagaatg 


acacaacaga 


ctttggagac 


ttggggaaag 


3660 


ggtgggaagg 


gggtgaggga 


taaaagacta 


caaatagggt 


gcagtgtata 


ctgcttgggt 


3720 


ggtgggtgca 


ccaaaatctc 


acaaatcacc 


accaaagaac 


ttactcatgt 


aaccaaacac 


3780 


cacctgttcc 


ccagtaacct 


atggatataa 


aaaaattaaa 


aaaaagaaaa 


aaagaaaact 


3840 


cttttttgca 


gggggcaggt 


aaagggtaag 


agggcatccc 


atttttgagt 


ttctagaaaa 


3900 


gctt 












3904 


<210> 15 
<211> 2119 

^ _L £t -** J-' tx 

<213> Homo 


sapiens 












<400> 15 














ctgcaggaag 


cagcagcaag 


gtccagggag 


cctctaattt 


aaataggaga 


agtcagagct 


60 


ttaacagcat 


tgacaaaaac 


aagcctccaa 


attatgcaaa 


tggaaacgaa 


aaaggtaagt 


120 
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gtttgttaca 


tcattatgac 


acaagtccaa 


catgagtctt 


gtgaattgca 


tgctaaatct 


180 


aatatttgag 


cagcgtaaca 


actttgggcc 


tagagatgtt 


atcagtggag 


tttctttatg 


240 


tttcctaact 


gtcccctcct 


gactgccagc 


tttcttatct 


gaagaacatt 


ttaaacaaat 


300 


aaactcattc 


attttaaagt 


agttagttat 


atatgcaagt 


acaaatactg 


tttctcaaaa 


360 


acaggtcctt 


ccaaatgcat 


gtaaatcaca 


ttttcttatg 


tctttttatg 


tttttgaaaa 


420 


tgtatcctga 


aatcataaag 


ccatattgaa 


tttatctgaa 


tccttaactt 


cagttaaggt 


480 


aagagccata 


agtgtttttg 


acaattaagg 


ttggagcatc 


aaaatttgaa 


acataattac 


540 


agtaggtttt 


tatctttgca 


agcagcagat 


cccagagata 


ttatgacctc 


agttttcccc 


600 


aaaagacaaa 


ttattcatat 


ttgttttgtt 


ttcttgaatt 


agtgcataat 


ataaatatca 


660 


aatcacaaaa 


tcaaggacat 


taaatgaaag 


tgtctgttaa 


aggcatatta 


taaatgaatc 


720 


ataagccaca 


cagttctctg 


tgatgtacga 


agtgggcatt 


taaagaggtg 


ctgatttgat 


780 


gcttgtcact 


gagtagcaga 


gaggacgggg 


atgagtatgt 


gtagtttaca 


cctcaatcat 


840 


gaggaagtga 


agaacttgtg 


ctgttataag 


tagtatggct 


gtgtgaggaa 


ctagggtgtt 


900 


ctgctggatt 


ttgaggaagt 


attttcaaat 


caatagaact 


tcaaactttt 


cttcagagtg 


960 


ttgggctcta 


catggaaaaa 


cacatgaaat 


taaaaagtgg 


cacaaatgtt 


tagttagtag 


1020 


aacatctggc 


taattgggat 


caaataattc 


aaccatgtgg 


gaacgttttt 


gctcaaaata 


1080 


gataattgtg 


aattgtttca 


tataggcaaa 


tgattagaca 


acttcctctt 


cctcaaatgt 


1140 


gaacggacag 


atgtgatcta 


gaagcaagac 


actcttttgt 


gtaaatattc 


cctttggcct 


1200 


aaagcaaaag 


tggacagact 


ttaaacacct 


gagagcagag 


cagtgtgtgt 


taagattgca 


1260 


atatcttaag 


ctcttgagtt 


aaatggaaaa 


tgaaaaacaa 


aagtgtatat 


ttggaagtta 


1320 


ggaatgtttt 


ctttaaaata 


taaaataaaa 


ttttagattt 


aagatcacaa 


gaaatattac 


1380 


tgaagactta 


tactcttcct 


ggggctaagg 


gaggtgacag 


tcgctcatca 


gaaaaaaaaa 


1440 


aatgccctca 


tttcctaact 


tttctaaaaa 


atataataca 


agttcaggct 


aatacttcct 


1500 


gtatatgtgg 


gaaatttcta 


ggggaagcta 


acaggcttag 


aaataaagat 


gtgttaaata 


1560 


gactaccaaa 


gtgtccaatt 


aagcaacacg 
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1620 


attactagca 


atgtttgtaa 


atagacttag 


aaatgcattt 


gatgaattaa 


cacttttata 


1680 


tcttaattta 


tctgaatttt 


tctgtaatgt 


gaaaatgttt 


tatttaactt 


atttctggca 


1740 


tctattagta 


aaattctgat 


gatatacaag 


cattaatatt 


tttccatggc 


cactcaattc 


1800 


atacatacct 


tccctatcta 


tgcttagaag 


gcagtgcaaa 


attagatagt 


agcaatattg 


1860 
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attataacca caaggtggag acagatgtca tgtaatatgc agtctgctca tataaagcac 1920 
attttcttag acaagagttt tcatacgata taataaagac atctggaatt tgtcttgtat 1980 
gcaatatgaa atttgctatt aaacgtggag ttaaaacttt atgtcaatag atccaataac 
aatgttcata aattaatcat tatgtcatgc tgtatttcca aaatactatc ttaaattata 
agagcaaacg aggtaataa 

<210> 16 

<211> 2103 

<212> DNA 

<213> Homo sapiens 



2040 
2100 
2119 



60 



300 
360 



<400> 16 

gtacattttt taataaagat gtttgtttta actttttgaa tatgaagatt tctagttcta 
gaataatgtt tataaaaata tacaaatcca tctggtgatg agttgacctc tatcacaact 120 
agtttgcata tataacttgg gtgtgaccaa gcaaggtgag agttaagaac ttttaaaact 180 
tactgtatta tattgataga actcagaaag tactaacttg aatattatta ttctaattgc 240 
ttttcccttt tagttattaa aaataagaat acttaaatta ataacaagat cttttactgg 
caggattaac caaattatct gtaatgtgtt cctcgaatgc ttttaagtgg aaatatactt 
tatacattct ttaacaactc tgagaggatg agttacataa atcagttcag gaatctatag 420 
aatctgtaat acatagtaaa ggtttattca caattaaaac aatttcactt ctatattaaa 480 
aaaacaaatt gttgaaagta cagtggcttt tcatatgtat gatttgtaaa acaaattagc 540 
ttttttaaag tgatgtgacg cttaatgaga agaaatcagt agagaattac aaactgcact 600 
tcaaaagata catctaatat cattttaata atgaaatttg aaaaaatagt gtgctcgttt 660 
tacagtctca ttaaatgaat taaaatatca gcacacattg tagtaggtta tcattggcag 720 
agaaggctga aatagaaacg ttacaatggg atgcactgcc atctgaacat tatgtcgaag 780 
tggaacgcgg aaacatattt ctcagaacaa gtggtaaaat gaaaacagca tcatttgtaa 
agcatttctt ttgagagtgc ttcagtttct tctcctgatg acctgccatt cagaaactga 
caatgaataa tacactctga caccagcatt tgtcaatttg cccagaacca tatgagagta 
ctctagacag atatatgttc cgaagtaaac cgaatacctg ttaactgtaa atcaaatctt 
gtagaaacca tgccatggtt cctttggaca tatactttgc atgcctgaag caagttacct 1080 
taagaaatca ttcttttgtt ttacaaaact tgtattaaaa aattaaaaat gcaaaaaagc 1140 



840 
900 
960 
1020 
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ttaatattat 


taggaattta 


tccatagctt 


tatttggaat 


ccagtttctt 


tattatgatc 


1200 


tataaacatg 


catcatttga 


tggagttcct 


tagtggagag 


gtgtttttcc 


atgttgctaa 


1260 


gaaacatgcc 


ccagcaccag 


aagggatact 


acctaccatc 


tttttgccat 


ttctcaccgt 


1320 


gattcttaca 


ttgtacctgt 


ttactcactg 


aacagggctt 


ccttctcttt 


gtctagattc 


1380 


taatcaggtg 


tcttctggtg 


tggaagcttt 


ggcttttatt 


tacacacaac 


acagaattaa 


1440 


taagatagat 


gccaaggatt 


tagcaacatt 


ttaattcaac 


attatacagg 


tatcagagtt 


1500 


aatgagaatt 


atgcattagt 


ctttaaattt 


gggcagctta 


ttcagctaaa 


acatagatgt 


1560 


ctagctctta 


aacactttgt 


ttttttaatt 


actctgaaat 


tacaataaag 


tcaaagaact 


1620 


gaactgtttt 


cttttcaagc 


cagtgcaaat 


gtgctttagt 


tattatttta 


ctggtgatct 


1680 


aattatgcat 


tttaatgctt 


tattacttaa 


tacttatata 


agcctaaaat 


acgttgttaa 


1740 


tgtcataatt 


tcagggattt 


tagtattctt 


tccatgagtt 


accataacta 


ggtgcatatg 


1800 


tgtaaatata 


cgtatatatc 


tatatctata 


tatttatatc 


tatgtatata 


tcaatttata 


1860 


agactaaata 


gacttggcca 


tatgtgttgt 


tggtttatgc 


atacatgcac 


aaatattgag 


1920 


gtgtccacaa 


agtatatatg 


cctgtacata 


aattacatac 


tggctggtga 


gtgaatgtaa 


1980 


gcttctctaa 


attgtacaac 


tctccacaga 


gtggcactct 


aatattgcaa 


aggtacaata 


2040 


taagcatgtg 


cagaatgaac 


agctcttcta 


ggatccctat 


aaaactccac 


cccatgtttc 


2100 


tgt 












2103 


<210> 17 
<211> 4260 
<212> DNA 
<213> Homo 


sapiens 












<400> 17 














aagcttcatc 


ccagaggggc 


acttgccaga 


tgcctgctag 


agctctcctg 


tatgaggagt 


60 


ctatcaacac 


ctgctgggag 


gtgtctcctc 


gtcaggaggc 


acgggggtca 


gggacccact 


120 


tgaggaggct 


gtctgtccct 


tagcggagct 


agaacac tgt 


get cggagat 


ccgctgctct 


180 


cttcagagct 


ggcaggcaag 


agtgttttag 


tctgctgagc 


ctgcgcccac 


agccgcccct 


240 


tcccccaggt 


gctctgtccc 


agggagatga 


gagttttatc 


tgtaagcccc 


tgactggggc 


300 


tgctaccttt 


ctttcagata 


tgccccgccc 


agagaggagg 


aatctagaga 


ggcagtctgg 


360 


ctacagcagc 


tttgccaagc 


tgcagtgggc 


tctgcccagt 


ccaaaattcc 


cagcgggttt 


420 
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gtttacattg 


tgaggggaaa 


agcacctact 


caagcctcag 


ttatggcagt 


tgcccctccc 


48 0 


cccaccaagc 


tccagggtcc 


caggtgtcct 


tcagactgct 


gtgctggcaa 


tgagaatttc 


54 0 


aagccagtgg 


atcttagctt 


gctgggctcc 


acaggggtgg 


gatccactga 


gctagaccac 


6 0 0 


ttagctccct 


ggcttcagcc 


ccctttccag 


gtgagtggat 


ggttctgtct 


cactggcatt 


66 0 


ccaggtgcta 


ctggggtatg 


aaaaaaaaaa 


ctcctgcagc 


tagcttggtg 


tctgcccagt 


72 0 


tttgtgcttg 


aaactcaggc 


ccttggtggt 


gtggacaccc 


aatggaatct 


cctggtgtgc 


7 8 0 


atgttgtgaa 


gactgtggga 


aaagcatagt 


atctgggctg 


gatagctccg 


tccttcaagg 


84 0 


cacagtccct 


catgacttcc 


cttggctagg 


ggagggagtt 


ccccaaccct 


ttgcacttcc 


900 


caggtgaggc 


aacaccccac 


cctgcttctg 


ctcaccctct 


gtgggctgca 


cccactgtct 


960 


aatcagtcac 


tgtgagatga 


gcctggtacc 


tcagttggaa 


atgcagaaat 


cacctgcctt 


102 0 


ctgtgttgat 


ctcactggga 


gcagcagact 


ggagctgttc 


ctattcagcc 


atctttctca 


10 8 0 


ggtcataatc 


atagattttt 


aattgatccc 


agcaacatgg 


attagtaaac 


agcatatttc 


114 0 


caagtgattt 


ttttttattt 


taaggtcaaa 


tctacaaaat 


attatagtgt 


tatcaccact 


12 00 


taaaattatt 


actggtgata 


ctatgtttgt 


ctctattcac 


attttattgc 


tagaaagaat 




tataatttgt 


agataataat 


agttatttga 


aatgtattac 


atatcctttt 


acttttaaga 


13 2 0 


agaggtgact 


taattatcta 


ggtatacaat 


tattttgagg 


atactaaatg 


tcatgaatag 


13 8 0 


caaatttatc 


atattgcttt 


cctaggtgaa 


gaccctgaaa 


caagaagaat 


gagaacagtt 


144 0 


aaaaacatag 


cagacttgag 


gcagaattta 


gaagagacta 


tgtccagtct 


tcgtgggact 


1500 


cagataagcc 


acaggttttt 


ttcaattttg 


catatatttg 


agccaataaa 


gaaaaaataa 


1560 


ttacaaacaa 


acatttaact 


tttcttataa 


tgacagagat 


gggatttcag 


tttcccctta 




ctattttctc 


ccttgtttta 


tatcaaattg 


attggtaatt 


atccttaaac 


tgagaattca 


1680 


cagtatatac 


ctatttatct 


tttatctcta 


tctctatctg 


ctatttatgt 


ctttttcagt 


174 0 


ataatttcca 


gtactgcaac 


taccaccatc 


actgttaagt 


ggatttgtaa 


tacctgtcct 


18 0 0 


agaaaacagt 


ggcacaagtt 


gcacttgaaa 


tgcatctggg 


cagggtagta 


gggagacatt 


1860 


caaacataat 


tgtagttaac 


tttcagaata 


ggtctgggaa 


ggttacagtg 


agttaaggat 


1920 


ttgttgaaaa 


tgtaaaacaa 


tatgttgttt 


tacccaaggt 


gtactgatgg 


cctttctttt 


1980 


gaaaacaaac 


gaaaagctat 


aaaatgtatg 


cccctttcca 


caatttgacc 


tcaaaatgaa 


2040 


tatagagttt 


agctttcggg 


aagatgacgt 


gtttataaga 


gatgaccctc 


aactccagcc 


2100 
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ttttctgtct tcatgcattc tagattatgg ccctaagtga accagagtat agttatttct 2160 

ccattttatt tgacagcacc ctggagacaa catttgacag cactgtgaca acagaagtta 2220 

atggaaggac catacccaac ttgacaagtc gacccacccc catgacctgg aggttgggcc 2280 

aggcatgtcc gcgacttcag gcgggagatg ctccctccct gggtgctggc tatcctcgca 2340 

gtggtaccag tcgattcatc cacacagacc cctcgaggtt catgtatacc acgcctctcc 2400 

gtcgagctgc tgtctctagg ctgggaaaca tgtcacagat tgacatgagt gagaaagcaa 2460 

gcagtgacct ggacatgtct tctgaggtcg atgtgggtgg atatatgagt gatggtgata 2520 

tccttgggaa aagtctcagg actgatgaca tcaacagtgg gtaagtaacc ctgttctccg 2580 

tcagcattgt gtgaagaggg gaggtggtct actataatgc attcactata aacaaatgtg 2640 

taagtttgcc cagaaagtca tgagaacata tgagatatct gaggttattc agagtgttga 2700 

agggcccttc ctctgctcat tcatggagag taaagaatcc aagatttcta taaattcatt 2760 

ataagccgct aagtttttct gttgttgaga gaaacacatg tggcttctgt ttttcagagt 2 820 

gattttcaca tgcttcttaa gtaacagatt ttgtagttaa ggacgtggga aggagacagg 2880 

aggagttttg ctgatttgct tgattttttt tttctttttt agcttgttag aagcggcctg 2940 

taactgcttt gagaaacaaa tattttctta ctgtcttcaa ttatgcatcc ccaatttaac 3000 

ttgagggaaa aatcactttg gagttgaaag tttcactcta ttcattttct tttgatggta 3060 

tcagatttca atacatctca gaccctgttt ttcttctgtg tcctattaca ttccaaaaca 3120 

tgttgtgatt gtaaaactct tagagtatat taacaatttg ggatatttgg cataatcaga 3180 

gaataggtcc aaaaggaggc aataggatat tctattaata attgtaattg ccatttttag 3240 

catttcctgt tatgtactat gctcttgtca agtgctttga agatagtgtt ttacttttcc 3300 

ttcccaccac cagcaatgtt tatgaggtag atgtttttat acatgttcta tggataagga 3360 

aactgagtct aattggcccc ggctgggaac taacgctagg gaaacggcag acctgcatta 3420 

gaactcagct atgtctgact tcaaacacag gctcagtaat atgtggaaaa gcttcccaat 3480 

taactttgtc tataaacttt gtgtgagtct ggattttgac ttactctttg tctttacgca 3540 

tctgagagga cccatgtagg aaataattct tctatataag tgacccttcc tgacttcatt 3600 

catgaaaagc ttatgtttga agggtgacac gacctaaaaa agagtacaaa atagcttttg 3 660 

attacattta tagctttgct ctgatatcct aatacctact agtccattcc tggtatccac 3720 

cctacctgac tttctaaaaa tttagaatta tagagactaa ttatgattaa ttaagatagg 3780 

ttgttgttca gttgccactg gattcagagt gcctagtttg aatctctccc attcactatc 3840 
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tgtggacccc ttcggaacct aacgtatcca aattagtttt tgtcatctag aataaggata 3900 

aaattgtacc atcttcatga agttgttagg atcatccaca aattttagtt tgcgcaatgc 3 960 

ttggcatgat acaagcactc aataaattta tcatcttcct ctttatcatc actattacat 4020 

ttattatcat taataaccat accaattttt ggttgttgtt agttataatt atcatttttg 4080 

tatgtattta acatagccta ggaggcaatg cccagttcag aaaacataat ggcaaagcaa 4140 

gagtgtctaa ggcacactct ttctcccatc tctctcttct ttcttctcca ttctttccac 4200 

tctatcccct cttctctttt ttttctcaat ctccttagat gtggacatat gtgtgaattc 4260 



<210> 18 

<211> 2429 

<212> DNA 

<213> Homo sapiens 

<400> 18 



tgtgggtgtg ggtgtgaagc atgtgtatgt gtgtgtgtga agcatctccc cacctgtaat 60 

gtaagtccat gagtgcagaa tttttgacat attctttacg tgttgagttt taacaaatgt 120 

ttgtggagtg aatgaacaaa ttaatgaata taggctattt attaattagg caatatagtc 180 

acataggctg gcaatcgcat ctaattaaat agagtggtaa atgagttcca gaaagaacta 240 

aggtactaca aggatgttat gaaagagaaa aatgagttat gtgaaaaata ggagacagtg 3 00 

ataagaggga aagaatccca aagtgtgggc cacattttga aactaatgac ctattattct 360 

attattgtta gctgaaagta gaaaacgtca tgggagggaa tatctgctag tttttggtaa 420 

aggatgttgt gatggcagaa ccaagaaatg aacacaaggt gactttggtt tggggacagt 480 

gggataatca actctccttg ctccatcagg gccccagact gggctctggc agaggaactc 540 

agaacaacgt aaagacctag ataggtatct aataaattgg gacctgtgaa aacagtgcct 600 

cttaaagtgt ggtacctgga ccagcagcag cagcagcagc agccattgaa acttcataga 660 

aagacagatt ctcagcttca tccaagactt actgaattag aatatctcaa ggtaaggcct 720 

ggtaatctga gctttaacta gccctcaagg tgattcttaa gttcaagcat cactatatta 780 

agttgaacaa atagatgcca ggcctataaa tacatgtaac gcctagcata aatatttcaa 840 

cattaaaaat gacatttcat agttcttatt taccctatta gctgtgttct gtcaagataa 900 

tgagaatatt gatatgttag aatacactga tgcactaatt tttaaattag atcaaataat 96 0 

gacttgttat acctgaaata aattggttca gcttggtaga tgcagttttt gagaattata 1020 
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taagtcattt 


ttaaaagaat 


aattttaact 


tgagctgctt 


gcataaatta 


aattgcaaaa 


10 8 0 


aggtcatagt 


ataaatcctc 


ctattagcag 


agatagaagg 


tttttaaaaa 


aat tacagat 




aagtctgaag 


gtcttttaaa 


atcttatatt 


caggaagtga 


ctcgggatgt 


_ +_ .4— 4-4— 4- 

atatcatttt 


noon 


aaaatacatg 


gtcttaaatg 


ttgtagttgt 


atgactcttt 


cagttaattt 


aaaat ac tt c 


i *? & n 


cttctatgaa 


aaattgtttc 


aaaaattttt 


ctaaattctg 


ttatccattt 


caagtaggat 


t *3 o n 


aggcaagaac 


agatataaga 


tactactttt 


ttgttcatgt 


ttactaaaaa 


aaa aat tact 




gtaattgaga 


tcatgtaaaa 


acatgtttcc 


tgtctatttg 


tcttaacctt 


ttaatcctgg 


t a a n 


caccttaaat 


ttgacatagt 


aggaattaga 


agacaattgc 


agaaaatgtc 


aactggggaa 


i c: n n 


attttattct 


actaaaaact 


atgtccatac 


aacatagcaa 


atcacatttt 


aaaggccaaa 




aagtctttca 


tagcaatttt 


tcagattatt 


ttcaaagcat 


atcttctctc 


tgctcctgca 


lbzU 


gcatgccgtt 


gatttttctg 


ttatgcagtc 


acataagtaa 


ttacatgttt 


acatgtctat 


loou 


ttcactcata 


gaacacgaaa 


cagttaaatg 


tagaataata 


tccaatccat 


ctttttatca 


t n a n 

1 / 4U 


ccagtagcta 


gcatactgta 


ggaactcaat 


aaatatatca 


gataaattgt 


ggaaataacc 


i onfl 


atatcagctt 


ataacatata 


gaaatgtgag 


tttaaaaaga 


aaacaattat 


acat atgaaa 


i orn 

lOOU 


aaatttttat 


accatttttt 


taaagacctt 


tcagatgtca 


tacagtttgg 


ac t t t tccag 


J. I? ^ VJ 


tgtttcttgt 


atcatgagac 


aatagtagac 


attgtaaatc 


aaaaatagtt 


t t c tggggt t 




gtgtacattt 


gaaaaaactg 


aatatcatat 


ctgttcttag 


agagtaatga 


tggatat t aa 


on/in 


catatcaaag 


gtacagagaa 


gtcttaaagt 


tcaaagtaac 


atctgcttaa 


ttgtat ttaa 


ziUU 


ttcagtgctc 


catgagcttt 


tttatcactg 


attccctccc 


ttttttctct 


tatgataata 


0 i £ n 

1 1DU 


attaacttgt 


tcctgtagca 


ttttaagaaa 


tgttgattta 


gttgaatgcc 


ttcacttctc 


2220 


caatataata 


gcagaaactc 


agaaatattt 


atttacccag 


aatcatgcag 


ctaatagtac 


2280 


aaggattcag 


gtcttttact 


tcctattttg 


tggttcccaa 


ctacttttgc 


caaaggtctt 


2340 


ttaaataata 


tgaaacatat 


tagtgattga 


ttcattatag 


taaatgggta 


aatgataagg 


2400 


cttgcaataa 


ttcactgaca 


agaaagctt 








2429 



<210> 19 
<211> 2926 
<212> DNA 
<213> mouse 

<220> 
<221> CDS 
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<222> (2) . . . (2926) 
<400> 19 

a age cac age acc ctg gag aca acc ttt gat acg act gtg aca act 46 
Ser His Ser Thr Leu Glu Thr Thr Phe Asp Thr Thr Val Thr Thr 

gaa gtg aat gga agg gec ate ccc aac ctg aca age cga cct tec ccc 94 

Glu Val Asn Gly Arg Ala lie Pro Asn Leu Thr Ser Arg Pro Ser Pro 

atg acc tgg aga ctg ggt caa gcg tgc cct cgt eta cag get gga gat 142 

Met Thr Trp Arg Leu Gly Gin Ala Cys Pro Arg Leu Gin Ala Gly Asp 

gec ccc tec atg ggc get gga tat tct cga age ggt acc age cga ttc 190 

Ala Pro Ser Met Gly Ala Gly Tyr Ser Arg Ser Gly Thr Ser Arg Phe 

ate cac acg gat ccc tec agg ttt atg tat acc acg cct etc cgc cga 238 

lie His Thr Asp Pro Ser Arg Phe Met Tyr Thr Thr Pro Leu Arg Arg 

get get gtc teg cgt ctg gga aac atg tea caa at a gat atg age gag 2 86 

Ala Ala Val Ser Arg Leu Gly Asn Met Ser Gin lie Asp Met Ser Glu 

aaa gca age agt gac ctg gat gtg tct tct gaa gtg gat gtt ggt gga 334 

Lys Ala Ser Ser Asp Leu Asp Val Ser Ser Glu Val Asp Val Gly Gly 

tac atg age gat ggt gat ate ctt ggg aag agt ctg aga gcg gat gat 3 82 

Tyr Met Ser Asp Gly Asp lie Leu Gly Lys Ser Leu Arg Ala Asp Asp 

ate aac agt ggg tac atg aca gat ggt ggg etc aac eta tat acc aga 43 0 

lie Asn Ser Gly Tyr Met Thr Asp Gly Gly Leu Asn Leu Tyr Thr Arg 

agt ctt aac cga gtc ccg gac aca gca act tec aga gat gtc ata cag 478 

Ser Leu Asn Arg Val Pro Asp Thr Ala Thr Ser Arg Asp Val lie Gin 

a 9 a 99 c 9tt cac gat gtg aca gtg gac gca gac age tgg gat gac age 526 

Arg Gly Val His Asp Val Thr Val Asp Ala Asp Ser Trp Asp Asp Ser 

agt tct gtg age agt ggc etc agt gac aca ctt gat aac att age aca 574 

Ser Ser Val Ser Ser Gly Leu Ser Asp Thr Leu Asp Asn lie Ser Thr 

gat gac etc aac acc acg tec tec ate agt tct tac tec aac ate act 622 

Asp Asp Leu Asn Thr Thr Ser Ser lie Ser Ser Tyr Ser Asn lie Thr 
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gtc ccc tec agg aag aac act cag ctg aaa aca gat gcg gag aaa cgt 670 
Val Pro Ser Arg Lys Asn Thr Gin Leu Lys Thr Asp Ala Glu Lys Arg 

teg aca aca gat gag acc tgg gat agt cct gag gag ctg aag aaa gec 718 
Ser Thr Thr Asp Glu Thr Trp Asp Ser Pro Glu Glu Leu Lys Lys Ala 

gag gga gat tgt gac age cat ggt gac gga gec gec aag tgg aag ggt 766 
Glu Gly Asp Cys Asp Ser His Gly Asp Gly Ala Ala Lys Trp Lys Gly 

get act tct gga ctt get gaa gac teg gag aag aca ggg cag aaa gec 814 
Ala Thr Ser Gly Leu Ala Glu Asp Ser Glu Lys Thr Gly Gin Lys Ala 

age ctg tct gtg tct cag aca ggc tec tgg agg aga ggc atg tct gec 862 
Ser Leu Ser Val Ser Gin Thr Gly Ser Trp Arg Arg Gly Met Ser Ala 

cag gga gga act cca get aca get agg cag aaa acc age aca agt gca 910 
Gin Gly Gly Thr Pro Ala Thr Ala Arg Gin Lys Thr Ser Thr Ser Ala 

etc aag acc cct ggg aag aca gat gat gee aaa get tec gag aaa ggg 958 
Leu Lys Thr Pro Gly Lys Thr Asp Asp Ala Lys Ala Ser Glu Lys Gly 

aaa act cct etc aaa gga tea tec ttg caa agg tct cct tea gat gca 1006 
Lys Thr Pro Leu Lys Gly Ser Ser Leu Gin Arg Ser Pro Ser Asp Ala 

ggg aaa age age ggg gat gaa ggg aaa aag cca ccg tea ggc att gga 1054 
Gly Lys Ser Ser Gly Asp Glu Gly Lys Lys Pro Pro Ser Gly lie Gly 

aga teg aca gee age agt tct ttt gga tac aag aag cca agt ggt gta 1102 
Arg Ser Thr Ala Ser Ser Ser Phe Gly Tyr Lys Lys Pro Ser Gly Val 

999 get tec act atg att acc age age ggt gee acc ate aca age ggt 1150 
Gly Ala Ser Thr Met lie Thr Ser Ser Gly Ala Thr lie Thr Ser Gly 

tea get aca ctg ggg aaa ate ccc aaa tec get gec att ggt ggg aag 1198 
Ser Ala Thr Leu Gly Lys lie Pro Lys Ser Ala Ala lie Gly Gly Lys 

tec aat gca gga agg aaa acc age ctg gac ggg tec cag aat caa gat 124 6 

Ser Asn Ala Gly Arg Lys Thr Ser Leu Asp Gly Ser Gin Asn Gin Asp 

gat gtt gtc ctg cac gtg age teg aag acc acc etc cag tac cgt agt 12 94 

Asp Val Val Leu His Val Ser Ser Lys Thr Thr Leu Gin Tyr Arg Ser 

ttg ccc cgc cct tct aag tec age acc age gga ate cct ggg aga ggt 1342 
Leu Pro Arg Pro Ser Lys Ser Ser Thr Ser Gly lie Pro Gly Arg Gly 
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ggc cac agg teg age acc age age att gat tec aat gtc age age aag 
Gly His Arg Ser Ser Thr Ser Ser lie Asp Ser Asn Val Ser Ser Lys 



tea get ggg gee acc acc tec aaa ctg aga gaa ccg act aag ate ggc 
Ser Ala Gly Ala Thr Thr Ser Lys Leu Arg Glu Pro Thr Lys lie Gly 



tea ggg cgc teg agt cca gtc act gtc aac caa aca gac aaa gag aag 
Ser Gly Arg Ser Ser Pro Val Thr Val Asn Gin Thr Asp Lys Glu Lys 



gag aaa gta gca gtg tea gat tea gag age gtt tec ttg tea ggt tec 
Glu Lys Val Ala Val Ser Asp Ser Glu Ser Val Ser Leu Ser Gly Ser 



ccc aaa tec age ccc acc tct gee agt gee tgt ggg act caa ggg etc 
Pro Lys Ser Ser Pro Thr Ser Ala Ser Ala Cys Gly Thr Gin Gly Leu 



aga cag cca ggg tec aaa tat cca gat att gec teg ccc aca ttt cga 
Arg Gin Pro Gly Ser Lys Tyr Pro Asp lie Ala Ser Pro Thr Phe Arg 



agg ttg ttc ggt gee aag gca ggc ggc aaa tct gee tec gca cct aat 
Arg Leu Phe Gly Ala Lys Ala Gly Gly Lys Ser Ala Ser Ala Pro Asn 



act gag ggg gcg aag tec tec tea gta gtg etc age cct agt acc tct 
Thr Glu Gly Ala Lys Ser Ser Ser Val Val Leu Ser Pro Ser Thr Ser 



tta gee cga caa ggc agt ctg gag tea ccg teg tec ggt acg gga age 
Leu Ala Arg Gin Gly Ser Leu Glu Ser Pro Ser Ser Gly Thr Gly Ser 



atg ggc agt get ggt ggg ctg agt ggc age age age cct etc ttc aat 
Met Gly Ser Ala Gly Gly Leu Ser Gly Ser Ser Ser Pro Leu Phe Asn 



aaa ccc tea gac eta act aca gat gtt ata age tta agt cac tec ttg 
Lys Pro Ser Asp Leu Thr Thr Asp Val lie Ser Leu Ser His Ser Leu 



get tec age cca gcg teg gtt cac tct ttc aca tec ggt ggg ctt gtg 
Ala Ser Ser Pro Ala Ser Val His Ser Phe Thr Ser Gly Gly Leu Val 



tgg 9 ct 9 CC aat ct 9 a 9 c a 9 fc tcc tct 9 CC 99C age aag gac act cca 
Trp Ala Ala Asn Leu Ser Ser Ser Ser Ala Gly Ser Lys Asp Thr Pro 



agt tac cag tcc atg act agt etc cat acg age tct gag tee att gac 
Ser Tyr Gin Ser Met Thr Ser Leu His Thr Ser Ser Glu Ser lie Asp 



1390 



1438 



1486 



1534 



1582 



1630 



1678 



1726 



1774 



1822 



1870 



1918 



1966 



2014 
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ctg ccc etc age cat cat ggc tec ctg 
Leu Pro Leu Ser His His Gly Ser Leu 



cac gag gtg cag age ctg etc atg aga 
His Glu Val Gin Ser Leu Leu Met Arg 



etc tea gaa aga tac ace cca tea tct 
Leu Ser Glu Arg Tyr Thr Pro Ser Ser 



ggc aaa gag tgg ctg cga teg cat tec 
Gly Lys Glu Trp Leu Arg Ser His Ser 



ggc aac cag tct ccc ttg gtc tec cct 
Gly Asn Gin Ser Pro Leu Val Ser Pro 



ace gga aaa tat cac ttt tec aac ttg 
Thr Gly Lys Tyr His Phe Ser Asn Leu 



cag ttt aac ctg cct gca ccc agt atg 
Gin Phe Asn Leu Pro Ala Pro Ser Met 



gee cag gac tec tec ttc gac etc tat 
Ala Gin Asp Ser Ser Phe Asp Leu Tyr 



agt gca act tec ctg gag gaa agg cca 
Ser Ala Thr Ser Leu Glu Glu Arg Pro 



tea ttc aga gac age atg gag gaa gtt 
Ser Phe Arg Asp Ser Met Glu Glu Val 



gtc tec age aca tea tec ctt tac tct 
Val Ser Ser Thr Ser Ser Leu Tyr Ser 



tea gag caa ate cat aag eta egg aga 
Ser Glu Gin lie His Lys Leu Arg Arg 



aaa gtc get acc etc acg tct cag ctg 
Lys Val Ala Thr Leu Thr Ser Gin Leu 



gca get ttt gaa aag agt tta ggg aat 
Ala Ala Phe Glu Lys Ser Leu Gly Asn 

eta acc atg aca gcg gaa caa aag gaa 



tct gga ctg acc aca ggc act 2062 
Ser Gly Leu Thr Thr Gly Thr 



acg ggt agt gtg aga tct act 2110 
Thr Gly Ser Val Arg Ser Thr 



egg cag gec aac caa gaa gaa 2158 
Arg Gin Ala Asn Gin Glu Glu 



act ggc ggg ctg cag gat act 2206 
Thr Gly Gly Leu Gin Asp Thr 



tct gee atg tea teg tea gee 2254 
Ser Ala Met Ser Ser Ser Ala 



gtg agt ccc acc aac etc tec 2302 
Val Ser Pro Thr Asn Leu Ser 



atg cgc tec age agt ate ccc 2350 
Met Arg Ser Ser Ser lie Pro 



gat gat gee cag ctt tgc ggt 23 98 

Asp Asp Ala Gin Leu Cys Gly 



c 99 9 CC 9 tt: a 9 c cac tec ggc 2446 
Arg Ala Val Ser His Ser Gly 



cat ggc tct tea ctg tea ttg 2494 
His Gly Ser Ser Leu Ser Leu 



acg get gaa gag aag get cat 2542 
Thr Ala Glu Glu Lys Ala His 



gaa ctg gtt gec tec cag gag 2590 
Glu Leu Val Ala Ser Gin Glu 



tea gca aat get cac ctt gta 2638 
Ser Ala Asn Ala His Leu Val 



atg act ggc cgt ttg caa agt 2686 
Met Thr Gly Arg Leu Gin Ser 

tct gag ctt ate gaa ctg egg 2734 
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Leu Thr Met Thr Ala Glu Gin Lys Glu Ser Glu Leu lie Glu Leu Arg 



gaa acc att gaa atg ttg aag gcc cag aac tct get gec caa gca gec 
Glu Thr lie Glu Met Leu Lys Ala Gin Asn Ser Ala Ala Gin Ala Ala 



att cag gga gca ctg aat ggc cca gac cac cct ccc aaa gat etc cgc 
He Gin Gly Ala Leu Asn Gly Pro Asp His Pro Pro Lys Asp Leu Arg 



acg age cat tec age att ggc agt ggt aat gat get gac tec aag aaa 
Thr Ser His Ser Ser He Gly Ser Gly Asn Asp Ala Asp Ser Lys Lys 



<210> 20 
<211> 975 
<212> PRT 
<213> Mouse 

<400> 20 



Ser His Ser Thr Leu Glu Thr Thr Phe Asp Thr Thr Val Thr Thr Glu 
15 10 15 

Val Asn Gly Arg Ala He Pro Asn Leu Thr Ser Arg Pro Ser Pro Met 
20 25 30 

Thr Trp Arg Leu Gly Gin Ala Cys Pro Arg Leu Gin Ala Gly Asp Ala 
35 40 45 

Pro Ser Met Gly Ala Gly Tyr Ser Arg Ser Gly Thr Ser Arg Phe lie 
50 55 60 

His Thr Asp Pro Ser Arg Phe Met Tyr Thr Thr Pro Leu Arg Arg Ala 
65 70 75 80 

Ala Val Ser Arg Leu Gly Asn Met Ser Gin He Asp Met Ser Glu Lys 
85 90 95 

Ala Ser Ser Asp Leu Asp Val Ser Ser Glu Val Asp Val Gly Gly Tyr 
100 105 110 

Met Ser Asp Gly Asp He Leu Gly Lys Ser Leu Arg Ala Asp Asp He 
115 120 125 

Asn Ser Gly Tyr Met Thr Asp Gly Gly Leu Asn Leu Tyr Thr Arg Ser 
130 135 140 

Leu Asn Arg Val Pro Asp Thr Ala Thr Ser Arg Asp Val He Gin Arg 
145 150 155 160 



2782 



2830 



ate aga aga cag cac tec tct gaa agt gtt tct agt ate aac age gca 2878 
He Arg Arg Gin His Ser Ser Glu Ser Val Ser Ser He Asn Ser Ala 



2926 
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Gly Val His Asp Val Thr Val Asp Ala Asp Ser Trp Asp Asp Ser Ser 
165 170 175 

Ser Val Ser Ser Gly Leu Ser Asp Thr Leu Asp Asn lie Ser Thr Asp 
180 185 190 

Asp Leu Asn Thr Thr Ser Ser lie Ser Ser Tyr Ser Asn lie Thr Val 
195 200 205 

Pro Ser Arg Lys Asn Thr Gin Leu Lys Thr Asp Ala Glu Lys Arg Ser 
210 215 220 

Thr Thr Asp Glu Thr Trp Asp Ser Pro Glu Glu Leu Lys Lys Ala Glu 
225 230 235 240 

Gly Asp Cys Asp Ser His Gly Asp Gly Ala Ala Lys Trp Lys Gly Ala 
245 250 255 

Thr Ser Gly Leu Ala Glu Asp Ser Glu Lys Thr Gly Gin Lys Ala Ser 
260 265 270 

Leu Ser Val Ser Gin Thr Gly Ser Trp Arg Arg Gly Met Ser Ala Gin 
275 280 285 

Gly Gly Thr Pro Ala Thr Ala Arg Gin Lys Thr Ser Thr Ser Ala Leu 
290 295 300 

Lys Thr Pro Gly Lys Thr Asp Asp Ala Lys Ala Ser Glu Lys Gly Lys 
305 310 315 320 

Thr Pro Leu Lys Gly Ser Ser Leu Gin Arg Ser Pro Ser Asp Ala Gly 
325 330 335 

Lys Ser Ser Gly Asp Glu Gly Lys Lys Pro Pro Ser Gly lie Gly Arg 
340 345 350 

Ser Thr Ala Ser Ser Ser Phe Gly Tyr Lys Lys Pro Ser Gly Val Gly 
355 360 365 



Ala Ser Thr Met 
370 

Ala Thr Leu Gly 

385 

Asn Ala Gly Arg 



Val Val Leu His 
420 

Pro Arg Pro Ser 
43 5 

His Arg Ser Ser 



lie Thr Ser Ser 
375 

Lys lie Pro Lys 
390 

Lys Thr Ser Leu 
405 

Val Ser Ser Lys 



Lys Ser Ser Thr 
440 

Thr Ser Ser lie 



Gly Ala Thr lie 
380 

Ser Ala Ala lie 
395 

Asp Gly Ser Gin 
410 

Thr Thr Leu Gin 
425 

Ser Gly lie Pro 



Asp Ser Asn Val 



Thr Ser Gly Ser 



Gly Gly Lys Ser 
400 

Asn Gin Asp Asp 
415 

Tyr Arg Ser Leu 
430 

Gly Arg Gly Gly 
445 

Ser Ser Lys Ser 
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450 455 460 

Ala Gly Ala Thr Thr Ser Lys Leu Arg Glu Pro Thr Lys lie Gly Ser 
465 470 475 480 

Gly Arg Ser Ser Pro Val Thr Val Asn Gin Thr Asp Lys Glu Lys Glu 
485 490 495 

Lys Val Ala Val Ser Asp Ser Glu Ser Val Ser Leu Ser Gly Ser Pro 
500 505 510 

Lys Ser Ser Pro Thr Ser Ala Ser Ala Cys Gly Thr Gin Gly Leu Arg 
515 520 525 

Gin Pro Gly Ser Lys Tyr Pro Asp lie Ala Ser Pro Thr Phe Arg Arg 
530 535 540 

Leu Phe Gly Ala Lys Ala Gly Gly Lys Ser Ala Ser Ala Pro Asn Thr 
545 550 555 560 

Glu Gly Ala Lys Ser Ser Ser Val Val Leu Ser Pro Ser Thr Ser Leu 
565 570 575 

Ala Arg Gin Gly Ser Leu Glu Ser Pro Ser Ser Gly Thr Gly Ser Met 
580 585 590 

Gly Ser Ala Gly Gly Leu Ser Gly Ser Ser Ser Pro Leu Phe Asn Lys 
595 600 605 

Pro Ser Asp Leu Thr Thr Asp Val lie Ser Leu Ser His Ser Leu Ala 
610 615 620 

Ser Ser Pro Ala Ser Val His Ser Phe Thr Ser Gly Gly Leu Val Trp 
625 630 635 640 

Ala Ala Asn Leu Ser Ser Ser Ser Ala Gly Ser Lys Asp Thr Pro Ser 
645 650 655 

Tyr Gin Ser Met Thr Ser Leu His Thr Ser Ser Glu Ser lie Asp Leu 
660 665 670 

Pro Leu Ser His His Gly Ser Leu Ser Gly Leu Thr Thr Gly Thr His 
675 680 685 

Glu Val Gin Ser Leu Leu Met Arg Thr Gly Ser Val Arg Ser Thr Leu 
690 695 700 

Ser Glu Arg Tyr Thr Pro Ser Ser Arg Gin Ala Asn Gin Glu Glu Gly 
705 710 715 720 



Lys Glu Trp Leu Arg Ser His Ser Thr Gly Gly Leu Gin Asp Thr Gly 
725 730 735 

Asn Gin Ser Pro Leu Val Ser Pro Ser Ala Met Ser Ser Ser- Ala Thr 
740 745 750 
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Gly Lys Tyr His Phe Ser Asn Leu Val Ser Pro Thr Asn Leu Ser Gin 
755 760 765 

Phe Asn Leu Pro Ala Pro Ser Met Met Arg Ser Ser Ser lie Pro Ala 
770 775 780 

Gin Asp Ser Ser Phe Asp Leu Tyr Asp Asp Ala Gin Leu Cys Gly Ser 
785 790 795 800 

Ala Thr Ser Leu Glu Glu Arg Pro Arg Ala Val Ser His Ser Gly Ser 
805 810 815 

Phe Arg Asp Ser Met Glu Glu Val His Gly Ser Ser Leu Ser Leu Val 
820 825 830 

Ser Ser Thr Ser Ser Leu Tyr Ser Thr Ala Glu Glu Lys Ala His Ser 
835 840 845 

Glu Gin lie His Lys Leu Arg Arg Glu Leu Val Ala Ser Gin Glu Lys 
850 855 860 

Val Ala Thr Leu Thr Ser Gin Leu Ser Ala Asn Ala His Leu Val Ala 
865 870 875 880 

Ala Phe Glu Lys Ser Leu Gly Asn Met Thr Gly Arg Leu Gin Ser Leu 
885 890 895 

Thr Met Thr Ala Glu Gin Lys Glu Ser Glu Leu lie Glu Leu Arg Glu 
900 905 910 

Thr lie Glu Met Leu Lys Ala Gin Asn Ser Ala Ala Gin Ala Ala lie 
915 920 925 

Gin Gly Ala Leu Asn Gly Pro Asp His Pro Pro Lys Asp Leu Arg lie 
930 935 940 

Arg Arg Gin His Ser Ser Glu Ser Val Ser Ser lie Asn Ser Ala Thr 
945 950 955 J 960 

Ser His Ser Ser lie Gly Ser Gly Asn Asp Ala Asp Ser Lys Lys 
965 970 975 



<210> 21 
<211> 2583 
<212> DNA 
<213> mouse 



<400> 21 

gggatgaagg gaaaaagcca ccgtcaggca ttggaagatc gacagccagc agttcttttg 60 

gatacaagaa gccaagtggt gtaggggctt ccactatgat taccagcagc ggtgccacca 120 

tcacaagcgg ttcagctaca ctggggaaaa tccccaaatc cgctgccatt ggtgggaagt 180 
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ccaatgcagg aaggaaaacc agcctggacg ggtcccagaa tcaagatgat gttgtcctgc 240 

acgtgagctc gaagaccacc ctccagtacc gtagtttgcc ccgcccttct aagtccagca 300 

ccagcggaat ccctgggaga ggtggccaca ggtcgagcac cagcagcatt gattccaatg 360 

tcagcagcaa gtcagctggg gccaccacct ccaaactgag agaaccgact aagatcggct 420 

cagggcgctc gagtccagtc actgtcaacc aaacagacaa agagaaggag aaagtagcag 480 

tgtcagattc agagagcgtt tccttgtcag gttcccccaa atccagcccc acctctgcca 540 

gtgcctgtgg gactcaaggg ctcagacagc cagggtccaa atatccagat attgcctcgc 600 

ccacatttcg aaggtaaggg tatgtaaaga gatgttggga aaacataaaa ggtagtatat 660 

agcatgtatt tattctgtac gaaactattt tcatgtattc taaatattct aagattctgt 720 

atcttatact tgtctaaaat atagtgattt tattttgctg attgcacctg ttgctagtgt 780 

aaaagcattg ctcatttaga gagtggttag cctttcagct atacagccag tgtgacacta 84 0 

aaatacagat accacttgta gcgggcataa aaccacatga ctgactattc atagaaataa 900 

agtgatagct tgtaaagata tttagtgatt tccacctctc ctttccagaa ttaaaaaaag 960 

caaattgcat agatctttat aaacacattt acttctagtg tatgttatct tgttgactct 1020 

taatgaaatg gcagttatga atatagatga tatattcttt ctaacagttt ataagagacc 1080 

aatttataca gtaccagatc ttaacatagt aacaataaca gcaacaaaaa caacccaaaa 1140 

agctatcaaa gtatggtctg attgcagaat ttgaaaacat ttacatgttt gacataggac 1200 

aagaactcag gagtgaggtg actttttata agtcttcatc aatgtccttt tacaggaacc 1260 

aggaagcata tctgatatat gtgtcaggat tatcacttta ttaattatgt gaaattctgt 1320 

ttagaaatct acctgatttt aaatacttta atatagtagg ggtcaaaatt agttaatgag 1380 

ttaagacaag ttgttaaata atcctggctc tgttttctca tcttcaaaat gatagagtat 1440 

aatttatcac ctcttgttaa atatttcagg tttgtgttta ttctcttgat aactttgatc 1500 

tcttagaaga gtcttgaaga atttacatta agtaatctta gaaacataac tatttgagaa 1560 

acagtagtca aattttgtca ttagaagtat taactctgaa gaatgatttg aagtgacagt 1620 

tcttagaaag aataaattat agcttgtagc aagagtaaat attttcactg cttgtgtgag 1680 

agccaagagc gccctcttgt ggcccattac ctatgaaaca atttctcata ttcgccctag 1740 

aaatcttcca ctgcaggaaa taatggattt cattgcctct gaattagtaa ccattctgcc 1800 

atttcttcat accattttat ttccatactt gcataaattt gattatgtca tctgcttcat 1860 

ttacaaaact aaaatgtttt ctgagctaaa ctccagtagc taacttagta caaatggtat 1920 
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ttttaaatca ctgctataag tatatatatt tgaatagctc tggcaacgga cggaaatccc 1980 

tatggtcttt ccatgggaag atacaaacca atccataagt tgtccagcaa tatccaatat 2040 

ttccagccca gccagtcagg cctcttaaac attaccttac atatttgaac ctttccttaa 2100 

atgtcccctt tagacaatct attttttaaa aagatgaaaa tccatttaag catcatatat 2160 

cgaatgcgta gaagttgttt cattataatg gttctgcaga taggtaatgc caaaacggcc 2220 

aaaatatttg atcactagaa gcgtaaaagt caagtacaat catgttgact ttttttccaa 2280 

ggtgggttca ctgctgccca ccttggttcc aggccagtgc ttacttaaga tatcgtaagt 2340 

gatttttttt taatttttaa ttttttagta gttggttaat caaaagccag tcatgtcacc 2400 

ttcaggaaca tagaggctgg acgtgcttgg cagctcacga ctccaaagca cacttggctc 2460 

tgtggactga aaccctagga aacgtggatg tgagtctctt ggaacaactc aagttgttat 252 0 

ttgtttttct tttaggttgt tcggtgccaa ggcaggcggc aaatctgcct ccgcacctaa 2580 

tac 2583 



<210> 


22 




<211> 


37 




<212> 


DNA 




<213> 


Homo 


sapiens 


<220> 






<221> 


CDS 




<222> 


(2) . , 


. (34) 


<400> 


22 





c cgc ggg get tec ate ctt cct ttg act gat ttt taa 37 
Arg Gly Ala Ser lie Leu Pro Leu Thr Asp Phe * 



<210> 23 

<211> 6768 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (5928) 

<400> 23 



att tta att tgt att ttc ccc gee gec ccg ccc ctt ttc etc cga ccc 
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He Leu He Cys He Phe Pro Ala Ala 



cgc cct ate get ccc egg ctt ccc tgc 
Arg Pro He Ala Pro Arg Leu Pro Cys 



ctt cct cgc gtt tct ttc ccc tgc gec 
Leu Pro Arg Val Ser Phe Pro Cys Ala 



etc cct cgc tct etc ccc ctt etc tec 
Leu Pro Arg Ser Leu Pro Leu Leu Ser 



gtc etc tct etc ccc etc etc etc ccc 
Val Leu Ser Leu Pro Leu Leu Leu Pro 



cgc ccc ctg ccc cct ccc ccc gtg cct 
Arg Pro Leu Pro Pro Pro Pro Val Pro 



gcg etc etc gcg ggc aga atg ctg ggc 
Ala Leu Leu Ala Gly Arg Met Leu Gly 



ccc gag gtg gag ctg age age ggc ggc 
Pro Glu Val Glu Leu Ser Ser Gly Gly 



ccg egg ggc gee ggc agg aag gcg gca 
Pro Arg Gly Ala Gly Arg Lys Ala Ala 



ctg ccc aag cgc gec aag gcg ccc ggc 
Leu Pro Lys Arg Ala Lys Ala Pro Gly 



gec age gcg get gag ctg aag gtc ttc 
Ala Ser Ala Ala Glu Leu Lys Val Phe 



cgt gtc ccc ggc ggg ccg ccc gec tec 
Arg Val Pro Gly Gly Pro Pro Ala Ser 



etc ace aac etc tct ttt etc acg gac 
Leu Thr Asn Leu Ser Phe Leu Thr Asp 



tat gag ccc gaa tgg age gac gat atg 
Tyr Glu Pro Glu Trp Ser Asp Asp Met 



ggc aag gtg ggg tec aag ggc cgt gaa 
Gly Lys Val Gly Ser Lys Gly Arg Glu 



Pro Pro Leu Phe Leu Arg Pro 

tct ttc ctt ttt ccc ggc ttc 96 
Ser Phe Leu Phe Pro Gly Phe 

etc ggc ttg cct etc tec etc 144 
Leu Gly Leu Pro Leu Ser Leu 

cct tct tec teg gtt tct tec 192 
Pro Ser Ser Ser Val Ser Ser 

cgc etc etc etc ctg cgc tec 240 
Arg Leu Leu Leu Leu Arg Ser 

gca gac gcg egg ate gtc cat 2 88 

Ala Asp Ala Arg He Val His 

age age gtc aag age gtg cag 336 
Ser Ser Val Lys Ser Val Gin 

ggc gac gag ggc gcg gac gaa 3 84 

Gly Asp Glu Gly Ala Asp Glu 

gcg gcg gac ggc aga ggc atg 432 
Ala Ala Asp Gly Arg Gly Met 

ggc ggc ggc ggc atg gec aag 4 80 

Gly Gly Gly Gly Met Ala Lys 

aag tec ggc age gtg gac age 52 8 

Lys Ser Gly Ser Val Asp Ser 

aac ctg cgc aag cag aag tea 576 
Asn Leu Arg Lys Gin Lys Ser 

tec gag aaa aag ctg cag ctt 624 
Ser Glu Lys Lys Leu Gin Leu 

gee aag gcg ccc aaa ggc tta 672 
Ala Lys Ala Pro Lys Gly Leu 

get ccg ctg atg tec aag acg 72 0 

Ala Pro Leu Met Ser Lys Thr 
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ctg etc tec age aag gec aag gcg caa aag age tct ggg cct gtc ccc 
Leu Leu Ser Ser Lys Ala Lys Ala Gin Lys Ser Ser Gly Pro Val Pro 



tct gee aag ggc cag gag gag cgc gec ttc etc aag gtg gac ccc gag 
Ser Ala Lys Gly Gin Glu Glu Arg Ala Phe Leu Lys Val Asp Pro Glu 



cag gtg act cac age tec ctg gag atg ace tgc tac gac age gat gat 
Gin Val Thr His Ser Ser Leu Glu Met Thr Cys Tyr Asp Ser Asp Asp 



gec aac cca cgc age gtg tec age etc tec aac cgc teg tac cct ctg 
Ala Asn Pro Arg Ser Val Ser Ser Leu Ser Asn Arg Ser Tyr Pro Leu 



ctg tec aag teg gag cac teg etc ttc cag gee aag ggc age ccg gcg 768 
Leu Ser Lys Ser Glu His Ser Leu Phe Gin Ala Lys Gly Ser Pro Ala 

ggc ggc gec aag ace ccc ctg get ccg etc gcg ccc aac ctg gga aag 816 
Gly Gly Ala Lys Thr Pro Leu Ala Pro Leu Ala Pro Asn Leu Gly Lys 

ccg age egg ate cct cga gga ccc tat gcg gag gtc aag ccg etc age 864 
Pro Ser Arg lie Pro Arg Gly Pro Tyr Ala Glu Val Lys Pro Leu Ser 

aag gcg cct gaa gcg gec gtg age gaa gat ggc aaa teg gac gac gag 912 
Lys Ala Pro Glu Ala Ala Val Ser Glu Asp Gly Lys Ser Asp Asp Glu 



960 



1008 



ctg gtg gtg ace gtg ctg gga gac ctg gag cag ctg cte ttc age cag 1056 
Leu Val Val Thr Val Leu Gly Asp Leu Glu Gin Leu Leu Phe Ser Gin 

atg ctg gac cca gag tec cag aga aag agg aca gtg cag aat gtc ctg 1104 
Met Leu Asp Pro Glu Ser Gin Arg Lys Arg Thr Val Gin Asn Val Leu 

gat etc egg cag aac ctg gaa gag acc atg tec age ctg cga ggg tec 1152 
Asp Leu Arg Gin Asn Leu Glu Glu Thr Met Ser Ser Leu Arg Gly Ser 



1200 



1248 



tea tgg cgc tat ggc cag tec agt ccg egg ctg cag get ggt gac gcg 12 96 
Ser Trp Arg Tyr Gly Gin Ser Ser Pro Arg Leu Gin Ala Gly Asp Ala 

ccc tct gtg ggt ggg age tgc cgc teg gag ggg acg ccc gee tgg tac 1344 
Pro Ser Val Gly Gly Ser Cys Arg Ser Glu Gly Thr Pro Ala Trp Tyr 

atg cac ggc gaa egg gec cac tac tec cac acc atg ccc atg cgc age 1392 
Met His Gly Glu Arg Ala His Tyr Ser His Thr Met Pro Met Arg Ser 
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agt act ccc act get tct cgc agg aac tea aca ata gtg eta cgc aca 
Ser Thr Pro Thr Ala Ser Arg Arg Asn Ser Thr lie Val Leu Arg Thr 



age tgt gat gat tea tec aag ggt gga gaa ctg aaa aag ccc ate age 
Ser Cys Asp Asp Ser Ser Lys Gly Gly Glu Leu Lys Lys Pro He Ser 

ctg ggc cac cct ggt tec ctg aag aag ggc aag ace cca cct gtg get 
Leu Gly His Pro Gly Ser Leu Lys Lys Gly Lys Thr Pro Pro Val Ala 



gta act tec ccc ate act cac aca gec cag agt gee etc aaa gtc gca 
Val Thr Ser Pro He Thr His Thr Ala Gin Ser Ala Leu Lys Val Ala 



aat act ggg etc caa cgc tec tec tct gat get ggt egg gac cgc ctg 
Asn Thr Gly Leu Gin Arg Ser Ser Ser Asp Ala Gly Arg Asp Arg Leu 



1488 



ccc age aag etc age cat ate tec cgc ctg gag ctg gtc gaa tec ctg 1440 

Pro Ser Lys Leu Ser His He Ser Arg Leu Glu Leu Val Glu Ser Leu 

gac teg gat gag gtg gac etc aag tec ggc tac atg age gac agt gac 

Asp Ser Asp Glu Val Asp Leu Lys Ser Gly Tyr Met Ser Asp Ser Asp 

etc atg ggc aag ace atg acg gag gat gat gac ate act ace ggc tgg 1536 

Leu Met Gly Lys Thr Met Thr Glu Asp Asp Asp He Thr Thr Gly Trp 

gat gaa age age tec ate agt agt gga etc age gat gee tea gac aat 1584 

Asp Glu Ser Ser Ser He Ser Ser Gly Leu Ser Asp Ala Ser Asp Asn 

etc agt tea gaa gaa ttc aat gec age tec tea etc aac tec etc cca 1632 

Leu Ser Ser Glu Glu Phe Asn Ala Ser Ser Ser Leu Asn Ser Leu Pro 



1680 



gac tea gag aag cgc tea ctg gca gaa agt ggg ctg age tgg ttt agt 172 8 

Asp Ser Glu Lys Arg Ser Leu Ala Glu Ser Gly Leu Ser Trp Phe Ser 

gaa tea gag gag aaa gec cct aaa aaa ctg gag tac gac agt ggt age 1776 

Glu Ser Glu Glu Lys Ala Pro Lys Lys Leu Glu Tyr Asp Ser Gly Ser 

ctg aag atg gaa cct ggg act tct aag tgg egg agg gag egg cct gag 1824 

Leu Lys Met Glu Pro Gly Thr Ser Lys Trp Arg Arg Glu Arg Pro Glu 



1872 



1920 



1968 



ggc aaa cct gag ggc aaa get aca gac aag ggt aag ctt gca gtg aag 2016 
Gly Lys Pro Glu Gly Lys Ala Thr Asp Lys Gly Lys Leu Ala Val Lys 



2064 
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agt gat get aag aag ccc ccc teg ggc att get cgc ccc tec act teg 2112 
Ser Asp Ala Lys Lys Pro Pro Ser Gly lie Ala Arg Pro Ser Thr Ser 



gga tec ttt ggc tac aag aag cct cct cct gee aca ggc aca gec act 
Gly Ser Phe Gly Tyr Lys Lys Pro Pro Pro Ala Thr Gly Thr Ala Thr 



gtc atg caa act ggt ggt tea gec act etc age aag ate cag aag tec 
Val Met Gin Thr Gly Gly Ser Ala Thr Leu Ser Lys lie Gin Lys Ser 



gtt tec aac agt gca gag cca gga ttc ctg get cct gga gee cgt tct 
Val Ser Asn Ser Ala Glu Pro Gly Phe Leu Ala Pro Gly Ala Arg Ser 



age gtg ace ggc ggg egg ggt gga cct cgc cct gtg age age age att 
Ser Val Thr Gly Gly Arg Gly Gly Pro Arg Pro Val Ser Ser Ser lie 



gac ccc agt etc etc age acc aag cag gga ggc ctt acg cct tec aga 
Asp Pro Ser Leu Leu Ser Thr Lys Gin Gly Gly Leu Thr Pro Ser Arg 

ctg aag gag cct acc aag gta gec agt ggg egg acc act cca gec cct 
Leu Lys Glu Pro Thr Lys Val Ala Ser Gly Arg Thr Thr Pro Ala Pro 



gee 



Ala Leu Asp Ser Asp Asn lie Ser Leu Lys Ser lie Gly Ser Pro Glu 



agt act ccc aag aac caa gca age cac ccc aca gee acc aag ctg gca 
Ser Thr Pro Lys Asn Gin Ala Ser His Pro Thr Ala Thr Lys Leu Ala 



gag ctg cca cca acc cct etc agg gee aca gcg aag age ttt gtc aaa 
Glu Leu Pro Pro Thr Pro Leu Arg Ala Thr Ala Lys Ser Phe Val Lys 



eta cca tea tec agt gat acc acc cat get tea aag gtc cca gat ctg 
Leu Pro Ser Ser Ser Asp Thr Thr His Ala Ser Lys Val Pro Asp Leu 



2160 



2208 



tea ggc ate cct gtc aag cca gta aat ggg cgc aag act age tta gat 2256 
Ser Gly lie Pro Val Lys Pro Val Asn Gly Arg Lys Thr Ser Leu Asp 



2304 



aac ate cag tac cgc age ctg ccc egg cca gee aag tea agt tct atg 2352 
Asn lie Gin Tyr Arg Ser Leu Pro Arg Pro Ala Lys Ser Ser Ser Met 



2400 



2448 



2496 



gtc aat cag aca gat egg gaa aag gag aag gec aaa gee aag gca gtg 2544 
Val Asn Gin Thr Asp Arg Glu Lys Glu Lys Ala Lys Ala Lys Ala Val 



ttg gac tea gac aac ate tec ttg aag agt att ggc tec cca gaa 2592 



2640 



2688 



cca ccc tea eta gec aat ctt gac aag gtc aac tec aac agt ctg gat 2736 
Pro Pro Ser Leu Ala Asn Leu Asp Lys Val Asn Ser Asn Ser Leu Asp 



2784 
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cat get aca age tea gca tct ggg ggc cct 
His Ala Thr Ser Ser Ala Ser Gly Gly Pro 

ccc agt ccg gca ccc ate etc aat att aac 
Pro Ser Pro Ala Pro lie Leu Asn lie Asn 

ggc ctg gag eta atg agt ggt ttc agt gtg 
Gly Leu Glu Leu Met Ser Gly Phe Ser Val 

tac ccc aaa etc tea ggc ctg cac agg age 
Tyr Pro Lys Leu Ser Gly Leu His Arg Ser 

cca atg age etc ccc agt gee ttc ccc age 
Pro Met Ser Leu Pro Ser Ala Phe Pro Ser 

cca cct get ccc cct get get ecc aca gaa 
Pro Pro Ala Pro Pro Ala Ala Pro Thr Glu 

act tgg agt gga age ccc aga get ggg caa 
Thr Trp Ser Gly Ser Pro Arg Ala Gly Gin 

gat egg aac act ctt ccc aag aaa ggg etc 
Asp Arg Asn Thr Leu Pro Lys Lys Gly Leu 

cag gag gag acc aag gag agg cga cat tec 
Gin Glu Glu Thr Lys Glu Arg Arg His Ser 

cct gaa tec gat gac cag tea gag ctg cct 
Pro Glu Ser Asp Asp Gin Ser Glu Leu Pro 

atg tct ctg agt gca aag ggc caa ctt acc 
Met Ser Leu Ser Ala Lys Gly Gin Leu Thr 

gcg gec acc acg cca aga ate acc cgc tec 
Ala Ala Thr Thr Pro Arg lie Thr Arg Ser 

gag gcg gee ttc gag ctg tac age ggc tec 
Glu Ala Ala Phe Glu Leu Tyr Ser Gly Ser 

tec ctg gee gag aga ccc aag gga atg att 
Ser Leu Ala Glu Arg Pro Lys Gly Met lie 

gac ccc acg gac gat gtt cac ggc tea gtg 
Asp Pro Thr Asp Asp Val His Gly Ser Val 

gee tec tec acc tac tec tea get gag gag 
Ala Ser Ser Thr Tyr Ser Ser Ala Glu Glu 

ate egg aag ctt cgt agg gaa ctg gaa tea 
lie Arg Lys Leu Arg Arg Glu Leu Glu Ser 

acc ttg acg tct cag ctt tct gee aat get 
Thr Leu Thr Ser Gin Leu Ser Ala Asn Ala 

gag cag age ctg gtg aat atg aca tec cgc 
Glu Gin Ser Leu Val Asn Met Thr Ser Arg 
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etc cct tec tgc ttc acc 2832 
Leu Pro Ser Cys Phe Thr 

tea gec age ttc tec cag 2880 
Ser Ala Ser Phe Ser Gin 

cca aaa gag acc cgc atg 2928 
Pro Lys Glu Thr Arg Met 

atg gag tec etc cag atg 2976 
Met Glu Ser Leu Gin Met 

agt act ccc gtc ccc acc 3024 
Ser Thr Pro Val Pro Thr 

gaa gag acg gaa gag ctg 3 072 

Glu Glu Thr Glu Glu Leu 

ctg gac agt aat cag egg 312 0 

Leu Asp Ser Asn Gin Arg 

agg tac cag ctt cag tec 3168 
Arg Tyr Gin Leu Gin Ser 

cat acc att ggt ggg ctg 3216 
His Thr lie Gly Gly Leu 

tct ccc cct gca ctt ccc 3264 
Ser Pro Pro Ala Leu Pro 

aac ata gtg agt ccc act 3312 
Asn lie Val Ser Pro Thr 

aac age ate ccc acc cac 3360 
Asn Ser lie Pro Thr His 

caa atg ggg age acc ctg 3408 
Gin Met Gly Ser Thr Leu 

egg tea gga tec ttc cga 3456 
Arg Ser Gly Ser Phe Arg 

ctg tec ctg gee tec agt 3504 
Leu Ser Leu Ala Ser Ser 

agg atg caa tct gag caa 3552 
Arg Met Gin Ser Glu Gin 

tec cag gaa aaa gtg gee 3600 
Ser Gin Glu Lys Val Ala 

aat ctg gtg get get ttt 3648 
Asn Leu Val Ala Ala Phe 

ctg cga cac ctg gca gag 3696 
Leu Arg His Leu Ala Glu 
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acg gcc gag gag aag gac act gag ctg ctg 
Thr Ala Glu Glu Lys Asp Thr Glu Leu Leu 



gac ttt ctg aag aaa aag aac tct gag gcc 
Asp Phe Leu Lys Lys Lys Asn Ser Glu Ala 

gcc ctt aat gcc tea gaa acc aca ccc aaa 
Ala Leu Asn Ala Ser Glu Thr Thr Pro Lys 

caa aac tec tea gat age ate tea age etc 
Gin Asn Ser Ser Asp Ser lie Ser Ser Leu 

tec age ate ggc age age aag gat get gat 
Ser Ser lie Gly Ser Ser Lys Asp Ala Asp 

aag agt tgg ctt cga agt tec ttc aac aaa 
Lys Ser Trp Leu Arg Ser Ser Phe Asn Lys 

ggg ccc aag tea get tec tea tac teg gat 
Gly Pro Lys Ser Ala Ser Ser Tyr Ser Asp 

ccc gac tct tea gcc ccc tea tec ccc aaa 
Pro Asp Ser Ser Ala Pro Ser Ser Pro Lys 

gag act get tea ccc tec ate aag tec tec 
Glu Thr Ala Ser Pro Ser lie Lys Ser Ser 

act gat gtc acc gag ggc cct get cac cca 
Thr Asp Val Thr Glu Gly Pro Ala His Pro 

ttc cat gca aat gag gag gag gag cca gag 
Phe His Ala Asn Glu Glu Glu Glu Pro Glu 

ctg cgc tct gag eta tgg gag aag gaa atg 
Leu Arg Ser Glu Leu Trp Glu Lys Glu Met 

ttg gag gcc etc aac tct gcc cac caa ctg 
Leu Glu Ala Leu Asn Ser Ala His Gin Leu 

atg cac aac atg cag ttg gag gtg gac ctg 
Met His Asn Met Gin Leu Glu Val Asp Leu 

cga ctg aag gta gcc cca ggc ccc tea tea 
Arg Leu Lys Val Ala Pro Gly Pro Ser Ser 

gtc cct gga tea tct gca tta tct tec cca 
Val Pro Gly Ser Ser Ala Leu Ser Ser Pro 

gca etc acc cat tec ttc ggc ccc agt ctt 
Ala Leu Thr His Ser Phe Gly Pro Ser Leu 

ccc atg gat ggc ate agt act tgt ggt cca 
Pro Met Asp Gly lie Ser Thr Cys Gly Pro 

egg gtg gtg gtg agg atg ccc ccg cag cac 
Arg Val Val Val Arg Met Pro Pro Gin His 



gat ttg cga gaa acc ata 3744 
Asp Leu Arg Glu Thr lie 



cag gca gtc att cag gga 3792 
Gin Ala Val lie Gin Gly 

gaa ctt egg ate aag aga 3 840 

Glu Leu Arg lie Lys Arg 

aac age ate act age cat 3888 
Asn Ser lie Thr Ser His 

gcg aaa aag aag aaa aaa 3 93 6 

Ala Lys Lys Lys Lys Lys 

gcg ttc agt ata aaa aag 3 984 

Ala Phe Ser lie Lys Lys 

ata gag gag att get aca 4032 
lie Glu Glu lie Ala Thr 

eta cag cat ggt tct aca 4080 
Leu Gin His Gly Ser Thr 

acc teg tec tec gtg ggc 4128 
Thr Ser Ser Ser Val Gly 

gcc ccc cac act agg ctg 4176 
Ala Pro His Thr Arg Leu 

aag aag gag gta teg gag 4224 
Lys Lys Glu Val Ser Glu 

aag ctt aca gac ate cgc 4272 
Lys Leu Thr Asp lie Arg 

gat cag ctt egg gag acc 4320 
Asp Gin Leu Arg Glu Thr 

ctg gaa gca gag aat gac 4368 
Leu Glu Ala Glu Asn Asp 

ggc tec act cca ggg cag 4416 
Gly Ser Thr Pro Gly Gin 

cgc cgc tec eta ggc ctg 4464 
Arg Arg Ser Leu Gly Leu 

gca gac aca gac ctg tea 4512 
Ala Asp Thr Asp Leu Ser 

aag gag gaa gtg acc etc 4560 
Lys Glu Glu Val Thr Leu 

ate ate aaa ggg gac ttg 4608 
lie lie Lys Gly Asp Leu 
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aag cag cag gaa 
Lys Gin Gin Glu 

gac tgg aag atg 
Asp Trp Lys Met 

att tct aaa atg 
lie Ser Lys Met 

ate cat ggc tac 
lie His Gly Tyr 



ttc ttc ctg ggc 
Phe Phe Leu Gly 

ctg gat gaa get 
Leu Asp Glu Ala 

gac cca gee tct 
Asp Pro Ala Ser 

age ate age cac 
Ser lie Ser His 



tgt age aag gtc 
Cys Ser Lys Val 

gtt ttc caa gtg 
Val Phe Gin Val 

ace ctg gga eta 
Thr Leu Gly Leu 

gtg aaa cga gtg 
Val Lys Arg Val 



a 9t 99 a aaa 9 tt: 
Ser Gly Lys Val 

ttc aag gac tat 
Phe Lys Asp Tyr 

age act gag tec 
Ser Thr Glu Ser 

ttg gat gca gag 
Leu Asp Ala Glu 



ccc ccc gag atg 
Pro Pro Glu Met 

tec etc aaa ggt 
Ser Leu Lys Gly 

acg ctg ate ccc 
Thr Leu lie Pro 

aag cac egg cgc 
Lys His Arg Arg 

tac ctg acc aat 
Tyr Leu Thr Asn 

gag gtc aca gag 
Glu Val Thr Glu 

tgc aag gat ctg 
Cys Lys Asp Leu 

egg gaa aca gga 
Arg Glu Thr Gly 

ctg agt gaa gca 
Leu Ser Glu Ala 

tgc aag tat cat 
Cys Lys Tyr His 

gta aaa atg aca 
Val Lys Met Thr 

acc ttc tec aac 
Thr Phe Ser Asn 

ctg agg agg aag 
Leu Arg Arg Lys 

gaa gag ctg ctt 
Glu Glu Leu Leu 

etc cac acc ttc 



cct cct tgc cgt 
Pro Pro Cys Arg 

ctg aag gag aaa 
Leu Lys Glu Lys 

aag ccg atg atg 
Lys Pro Met Met 

etc gtc etc teg 
Leu Val Leu Ser 

cgc ttg gec gag 
Arg Leu Ala Glu 

ggc ate gtc age 
Gly lie Val Ser 

caa ctg tat ctt 
Gin Leu Tyr Leu 

att ggg gat gtg 
lie Gly Asp Val 

ggc tec ate agt 
Gly Ser lie Ser 

aaa tgt ccc tat 
Lys Cys Pro Tyr 

ccc aac cat ggc 
Pro Asn His Gly 

aac gtg gag cca 
Asn Val Glu Pro 

ctg gta gag tea 
Leu Val Glu Ser 

egg gtg etc gac 
Arg Val Leu Asp 

ctt gag aag cac 



cga ggt gtc aat 
Arg Gly Val Asn 

tgc gtc gac age 
Cys Val Asp Ser 

cag cac tac ata 
Gin His Tyr lie 

ggc ccc age ggc 
Gly Pro Ser Gly 

tac ctg gtg gag 
Tyr Leu Val Glu 

acc ttc aac atg 
Thr Phe Asn Met 

tec aac eta gee 
Ser Asn Leu Ala 

ccc ctg gtg att 
Pro Leu Val lie 

gag ttg gtc aat 
Glu Leu Val Asn 

att ata ggt acc 
lie lie Gly Thr 

ttt cac ttg age 
Phe His Leu Ser 

gee aat ggc ttc 
Ala Asn Gly Phe 

gac age gac ate 
Asp Ser Asp lie 

tgg gta ccc aag 
Trp Val Pro Lys 

age acc tea gac 



aac ata tea gtc 
Asn lie Ser Val 

ctg gtg ttc gag 
Leu Val Phe Glu 

age etc ctg ctg 
Ser Leu Leu Leu 

acg ggc aag acc 
Thr Gly Lys Thr 

cgc tct ggc cgt 
Arg Ser Gly Arg 

cac cag cag tct 
His Gin Gin Ser 

aac cag ata gac 
Asn Gin lie Asp 

eta ttg gat gac 
Leu Leu Asp Asp 

ggg gec etc acc 
Gly Ala Leu Thr 

acc aat cag cct 
Thr Asn Gin Pro 

ttc agg atg ttg 
Phe Arg Met Leu 

ctg gtt cgt tac 
Leu Val Arg Tyr 

aat gee aac aag 
Asn Ala Asn Lys 

ctg tgg tat cat 
Leu Trp Tyr His 

ttc etc ate ggc 



4656 
4704 
4752 
4800 

4848 

4896 

4944 

4992 

5040 

5088 

5136 

5184 

5232 

5280 

5328 

5376 

5424 

5472 

5520 
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Leu 


Hi s 


Thr 


Phe 


Leu 


Glu 


Lys 


His 


Ser 


Thr 


Ser 


Asp 


Phe 


Leu 


lie 


Gly 




cct 


tgc 


ttc 


ttt 


ctg 


teg 


tgt 


ccc 


att 


ggc 


att 


gag 


gac 


ttc 


egg 


acc 


5 568 


Pro 


Cy s 


Phe 


Phe 


Leu 


Ser 


Cvs 


Pro 


He 


Gly 


He 


Glu 


Asp 


Phe 


Arg 


Thr 




tgg 


ttc 


att 


gac 


ctg 


tgg 


aac 


aac 


tct 


ate 


att 


ccc 


tat 


eta 


cag 


gaa 


5 616 


irp 


Phe 


lie 


Asp 


Leu 


Trn 


Asn 


Asn 


Ser 


He 


He 


Pro 


Tyr 


Leu 


Gin 


Glu 




gga 


gcc 


aag 


gat 


ggg 


ata 


aag 


gtc 


cat 


gga 


cag 


aaa 


get 


get 


tgg 


gag 


5664 


Pi w 

\j± y 




Lys 


Asp 


Gly 


He 


Lys 


Val 


His 


Gly 


Gin 


Lys 


Ala 


Ala 


Trp 


Glu 




gac 


cca 


gtg 


gaa 


tgg 


gtc 


egg 


gac 


aca 


Ctt 


ccc 


tgg 


cca 


tea 


gcc 


caa 


5712 


Asp 


Piro 


Val 


Glu 


Trr> 


Val 


Arg 


Asp 


Thr 


Leu 


Pro 


Trp 


Pro 


Ser 


Ala 


Gin 




c a a. 


gac 


caa 


tea 


aag 


ctg 


tac 


cac 


ctg 


ccc 


cca 


ccc 


ace 


gtg 


ggc 


cct 


5760 


Gin 


Asp 


Gin 


Ser 


Lys 


Leu 


Tyr 


His 


Leu 


Pro 


Pro 


Pro 


Thr 


Val 


Gly 


Pro 




cac 


age 


att 


gcc 


tea 


cct 


ccc 


gag 


gat 


agg 


aca 


gtc 


aaa 


gac 


age 


acc 


5808 


His 


Ser 


He 


Ala 


Ser 


Pro 


Pro 


Glu 


Asp 


Arg 


Thr 


Val 


Lys 


Asp 


Ser 


Thr 




cca 


agt 


tct 


ctg 


gac 


tea 


gat 


cct 


ctg 


atg 


gcc 


atg 


ctg 


ctg 


aaa 


ctt 


5856 


Pro 


Ser 


Ser 


Leu 


Asp 


Ser 


Asp 


Pro 


Leu 


Met 


Ala 


Met 


Leu 


Leu 


Lys 


Leu 




caa 


gaa 


get 


gcc 


aac 


tac 


att 


gag 


tct 


cca 


gat 


cga 


gaa 


ace 


ate 


ctg 


5904 


Gin 


Glu 


Ala 


Ala 


Asn 


Tyr 


He 


Glu 


Ser 


Pro 


Asp 


Arg 


Glu 


Thr 


He 


Leu 




gac 


ccc 


aac 


ctt 


cag 


gca 


aca 


ctt 


taagggttcg < 


gcaatcactg 




5948 




Asp 


Pro 


Asn 


Leu 


Gin 


Ala 


Thr 


Leu 





















tcacccccgg acagcagaac gctggcatca gctatcttag ctcctcctct cccctctcct 6008 

ctttcagagc actggctctc cagccccagg aggagaacag gagggaggag gagatgaaag 6068 

aggagggaca ggttcttggt gctgtacctt tgagaacttc ctaggaagga atggtggggt 6128 

ggcgtttggg aacttgtgcc ccctaaacac atttactggc ctcctctaat gactttgggg 6188 

aaaagatgat tctgggtctt tcccttgact tcttgtttca attacaaact cctgggcttt 6248 

ctggggaggg gttcagaaaa catcaaaaca ctgcagcagt tcctaaatga ttctcacaag 63 08 

caaccctgag agagacagtc ttgtgaggga gatctggggg aggcaggaag ctcctcagat 6368 

tttctcacag acccttccca attccatcac cactgccaac aactcctccc ccagagatct 6428 

ggctggagcc cagaaaaaga agcatgtggt ttaaaaaatg tttaaatcaa tctgtaaaag 6488 

gtaaaaatga aaaacaaaaa caagcaaaca aacaaaaaac aatggaaaag atgaagctgg 6548 

agagagagga accagttgcc aaggtagaga gctgcccgct cctgccctct ggatgacata 6608 

ggggacatca acaagaegge tgccaacctg agaagtcacc aaaccacaaa aataacctta 6668 

cagccttcag ggaaagacta ccagctctgt ctttctaccc tctaatttaa caatgeataa 6728 
gagtcaataa accctacttt tttaaaaaaa aaaaaaaaag 6768 
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<210> 24 
<211> 11 
<212> PRT 

<213> Homo sapiens 



<400> 24 

Arg Gly Ala Ser lie Leu Pro Leu Thr Asp Phe 
15 10 



<210> 25 
<211> 1976 
<212> PRT 

<213> Homo sapiens 



<400> 25 



lie Leu lie Cys 



Arg Pro lie Ala 
20 

Leu Pro Arg Val 
35 

Leu Pro Arg Ser 
50 

Val Leu Ser Leu 
65 

Arg Pro Leu Pro 



Ala Leu Leu Ala 
100 

Pro Glu Val Glu 
115 

Pro Arg Gly Ala 
130 

Leu Pro Lys Arg 
145 

Ala Ser Ala Ala 



lie Phe Pro Ala 

5 

Pro Arg Leu Pro 



Ser Phe Pro Cys 
40 

Leu Pro Leu Leu 

55 

Pro Leu Leu Leu 
70 

Pro Pro Pro Val 
85 

Gly Arg Met Leu 



Leu Ser Ser Gly 
120 

Gly Arg Lys Ala 
135 

Ala Lys Ala Pro 
150 

Glu Leu Lys Val 



Ala Pro Pro Leu 
10 

Cys Ser Phe Leu 
25 

Ala Leu Gly Leu 



Ser Pro Ser Ser 
60 

Pro Arg Leu Leu 
75 

Pro Ala Asp Ala 
90 

Gly Ser Ser Val 
105 

Gly Gly Asp Glu 



Ala Ala Ala Asp 
140 

Gly Gly Gly Gly 
155 

Phe Lys Ser Gly 



Phe Leu Arg Pro 
15 

Phe Pro Gly Phe 
30 

Pro Leu Ser Leu 
45 

Ser Val Ser Ser 



Leu Leu Arg Ser 
80 

Arg lie Val His 
95 

Lys Ser Val Gin 
110 

Gly Ala Asp Glu 
125 

Gly Arg Gly Met 



Gly Met Ala Lys 
160 

Ser Val Asp Ser 
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165 

Arg Val Pro Gly Gly 
180 

Leu Thr Asn Leu Ser 
195 

Tyr Glu Pro Glu Trp 
210 

Gly Lys Val Gly Ser 
225 

Leu Ser Lys Ser Glu 
245 

Gly Gly Ala Lys Thr 
260 

Pro Ser Arg lie Pro 
275 

Lys Ala Pro Glu Ala 
290 

Leu Leu Ser Ser Lys 
305 

Ser Ala Lys Gly Gin 
325 

Leu Val Val Thr Val 
340 

Met Leu Asp Pro Glu 
355 

Asp Leu Arg Gin Asn 
370 

Gin Val Thr His Ser 
385 



170 

Pro Pro Ala Ser Asn Leu 
185 

Phe Leu Thr Asp Ser Glu 
200 

Ser Asp Asp Met Ala Lys 
215 

Lys Gly Arg Glu Ala Pro 
230 235 

His Ser Leu Phe Gin Ala 
250 

Pro Leu Ala Pro Leu Ala 
265 

Arg Gly Pro Tyr Ala Glu 
280 

Ala Val Ser Glu Asp Gly 
295 

Ala Lys Ala Gin Lys Ser 
310 315 

Glu Glu Arg Ala Phe Leu 
330 

Leu Gly Asp Leu Glu Gin 
345 

Ser Gin Arg Lys Arg Thr 
360 

Leu Glu Glu Thr Met Ser 
375 

Ser Leu Glu Met Thr Cys 
390 395 



175 

Arg Lys Gin Lys Ser 
190 

Lys Lys Leu Gin Leu 
205 

Ala Pro Lys Gly Leu 
220 

Leu Met Ser Lys Thr 
240 

Lys Gly Ser Pro Ala 
255 

Pro Asn Leu Gly Lys 
270 

Val Lys Pro Leu Ser 
285 

Lys Ser Asp Asp Glu 
300 

Ser Gly Pro Val Pro 
320 

Lys Val Asp Pro Glu 

335 

Leu Leu Phe Ser Gin 
350 

Val Gin Asn Val Leu 
365 

Ser Leu Arg Gly Ser 
380 

Tyr Asp Ser Asp Asp 
400 



Ala Asn Pro Arg 



Ser Trp Arg Tyr 
420 

Pro Ser Val Gly 
435 

Met His Gly Glu 
450 



Ser Val Ser Ser 
405 

Gly Gin Ser Ser 



Gly Ser Cys Arg 
440 

Arg Ala His Tyr 
455 



Leu Ser Asn Arg 
410 

Pro Arg Leu Gin 
425 

Ser Glu Gly Thr 



Ser His Thr Met 
460 



Ser Tyr Pro Leu 
415 

Ala Gly Asp Ala 
430 

Pro Ala Trp Tyr 
445 

Pro Met Arg Ser 



63 



Pro Ser Lys Leu Ser His lie Ser Arg Leu Glu Leu Val Glu Ser Leu 
465 470 475 480 

Asp Ser Asp Glu Val Asp Leu Lys Ser Gly Tyr Met Ser Asp Ser Asp 
485 490 495 

Leu Met Gly Lys Thr Met Thr Glu Asp Asp Asp lie Thr Thr Gly Trp 
500 505 510 

Asp Glu Ser Ser Ser lie Ser Ser Gly Leu Ser Asp Ala Ser Asp Asn 
515 520 525 

Leu Ser Ser Glu Glu Phe Asn Ala Ser Ser Ser Leu Asn Ser Leu Pro 
530 535 540 

Ser Thr Pro Thr Ala Ser Arg Arg Asn Ser Thr lie Val Leu Arg Thr 
545 550 555 560 

Asp Ser Glu Lys Arg Ser Leu Ala Glu Ser Gly Leu Ser Trp Phe Ser 
565 570 575 

Glu Ser Glu Glu Lys Ala Pro Lys Lys Leu Glu Tyr Asp Ser Gly Ser 
580 585 590 

Leu Lys Met Glu Pro Gly Thr Ser Lys Trp Arg Arg Glu Arg Pro Glu 
595 600 605 

Ser Cys Asp Asp Ser Ser Lys Gly Gly Glu Leu Lys Lys Pro lie Ser 
610 615 620 

Leu Gly His Pro Gly Ser Leu Lys Lys Gly Lys Thr Pro Pro Val Ala 
625 630 635 640 

Val Thr Ser Pro lie Thr His Thr Ala Gin Ser Ala Leu Lys Val Ala 
645 650 655 

Gly Lys Pro Glu Gly Lys Ala Thr Asp Lys Gly Lys Leu Ala Val Lys 
660 665 670 

Asn Thr Gly Leu Gin Arg Ser Ser Ser Asp Ala Gly Arg Asp Arg Leu 
675 680 685 

Ser Asp Ala Lys Lys Pro Pro Ser Gly He Ala Arg Pro Ser Thr Ser 
690 695 700 

Gly Ser Phe Gly Tyr Lys Lys Pro Pro Pro Ala Thr Gly Thr Ala Thr 
705 710 715 720 

Val Met Gin Thr Gly Gly Ser Ala Thr Leu Ser Lys He Gin Lys Ser 
725 730 735 

Ser Gly He Pro Val Lys Pro Val Asn Gly Arg Lys Thr Ser Leu Asp 
740 745 750 



Val Ser Asn Ser Ala Glu Pro Gly Phe Leu Ala Pro Gly Ala Arg Ser 
755 760 765 



64 



Asn He Gin Tyr Arg Ser Leu Pro Arg Pro Ala Lys Ser Ser Ser Met 
770 775 780 



Ser Val Thr Gly Gly Arg Gly Gly Pro Arg Pro Val Ser Ser Ser He 
785 790 795 800 

Asp Pro Ser Leu Leu Ser Thr Lys Gin Gly Gly Leu Thr Pro Ser Arg 
805 810 815 

Leu Lys Glu Pro Thr Lys Val Ala Ser Gly Arg Thr Thr Pro Ala Pro 
820 825 830 

Val Asn Gin Thr Asp Arg Glu Lys Glu Lys Ala Lys Ala Lys Ala Val 
835 840 845 

Ala Leu Asp Ser Asp Asn He Ser Leu Lys Ser He Gly Ser Pro Glu 
850 855 860 

Ser Thr Pro Lys Asn Gin Ala Ser His Pro Thr Ala Thr Lys Leu Ala 
865 870 875 880 

Glu Leu Pro Pro Thr Pro Leu Arg Ala Thr Ala Lys Ser Phe Val Lys 
885 890 895 

Pro Pro Ser Leu Ala Asn Leu Asp Lys Val Asn Ser Asn Ser Leu Asp 
900 905 910 

Leu Pro Ser Ser Ser Asp Thr Thr His Ala Ser Lys Val Pro Asp Leu 
915 920 925 

His Ala Thr Ser Ser Ala Ser Gly Gly Pro Leu Pro Ser Cys Phe Thr 
930 935 940 

Pro Ser Pro Ala Pro He Leu Asn He Asn Ser Ala Ser Phe Ser Gin 
945 950 955 960 

Gly Leu Glu Leu Met Ser Gly Phe Ser Val Pro Lys Glu Thr Arg Met 
965 970 975 

Tyr Pro Lys Leu Ser Gly Leu His Arg Ser Met Glu Ser Leu Gin Met 
980 985 990 

Pro Met Ser Leu Pro Ser Ala Phe Pro Ser Ser Thr Pro Val Pro Thr 
995 1000 1005 

Pro Pro Ala Pro Pro Ala Ala Pro Thr Glu Glu Glu Thr Glu Glu Leu 
1010 1015 1020 

Thr Trp Ser Gly Ser Pro Arg Ala Gly Gin Leu Asp Ser Asn Gin Arg 
1025 1030 1035 1040 

Asp Arg Asn Thr Leu Pro Lys Lys Gly Leu Arg Tyr Gin Leu Gin Ser 
1045 1050 1055 

Gin Glu Glu Thr Lys Glu Arg Arg His Ser His Thr He Gly Gly Leu 
1060 1065 1070 



65 



Pro Glu Ser Asp Asp Gin Ser Glu Leu Pro Ser Pro Pro Ala Leu Pro 
1075 1080 1085 

Met Ser Leu Ser Ala Lys Gly Gin Leu Thr Asn lie Val Ser Pro Thr 
1090 1095 1100 



Ala Ala Thr Thr Pro Arg lie Thr Arg Ser Asn Ser lie Pro Thr His 
1105 1110 1115 1120 

Glu Ala Ala Phe Glu Leu Tyr Ser Gly Ser Gin Met Gly Ser Thr Leu 
1125 1130 1135 

Ser Leu Ala Glu Arg Pro Lys Gly Met lie Arg Ser Gly Ser Phe Arg 
1140 1145 1150 

Asp Pro Thr Asp Asp Val His Gly Ser Val Leu Ser Leu Ala Ser Ser 
1155 1160 1165 

Ala Ser Ser Thr Tyr Ser Ser Ala Glu Glu Arg Met Gin Ser Glu Gin 
1170 1175 1180 

lie Arg Lys Leu Arg Arg Glu Leu Glu Ser Ser Gin Glu Lys Val Ala 
1185 1190 1195 1200 

Thr Leu Thr Ser Gin Leu Ser Ala Asn Ala Asn Leu Val Ala Ala Phe 
1205 1210 1215 

Glu Gin Ser Leu Val Asn Met Thr Ser Arg Leu Arg His Leu Ala Glu 
1220 1225 1230 

Thr Ala Glu Glu Lys Asp Thr Glu Leu Leu Asp Leu Arg Glu Thr lie 
1235 1240 1245 

Asp Phe Leu Lys Lys Lys Asn Ser Glu Ala Gin Ala Val lie Gin Gly 
1250 1255 1260 

Ala Leu Asn Ala Ser Glu Thr Thr Pro Lys Glu Leu Arg He Lys Arg 
1265 1270 1275 1280 

Gin Asn Ser Ser Asp Ser lie Ser Ser Leu Asn Ser He Thr Ser His 
1285 1290 1295 

Ser Ser He Gly Ser Ser Lys Asp Ala Asp Ala Lys Lys Lys Lys Lys 
1300 1305 1310 

Lys Ser Trp Leu Arg Ser Ser Phe Asn Lys Ala Phe Ser He Lys Lys 
1315 1320 1325 

Gly Pro Lys Ser Ala Ser Ser Tyr Ser Asp He Glu Glu He Ala Thr 
1330 1335 1340 

Pro Asp Ser Ser Ala Pro Ser Ser Pro Lys Leu Gin His Gly Ser Thr 
1345 1350 1355 1360 

Glu Thr Ala Ser Pro Ser He Lys Ser Ser Thr Ser Ser Ser Val Gly 
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1365 1370 1375 

Thr Asp Val Thr Glu Gly Pro Ala His Pro Ala Pro His Thr Arg Leu 
1380 1385 1390 

Phe His Ala Asn Glu Glu Glu Glu Pro Glu Lys Lys Glu Val Ser Glu 
1395 1400 1405 

Leu Arg Ser Glu Leu Trp Glu Lys Glu Met Lys Leu Thr Asp lie Arg 
1410 1415 1420 

Leu Glu Ala Leu Asn Ser Ala His Gin Leu Asp Gin Leu Arg Glu Thr 
1425 1430 1435 1440 

Met His Asn Met Gin Leu Glu Val Asp Leu Leu Glu Ala Glu Asn Asp 
1445 1450 1455 



Arg Leu Lys Val Ala Pro Gly Pro Ser Ser Gly Ser Thr Pro Gly Gin 
1460 1465 1470 

Val Pro Gly Ser Ser Ala Leu Ser Ser Pro Arg Arg Ser Leu Gly Leu 
1475 1480 1485 

Ala Leu Thr His Ser Phe Gly Pro Ser Leu Ala Asp Thr Asp Leu Ser 
1490 1495 1500 

Pro Met Asp Gly lie Ser Thr Cys Gly Pro Lys Glu Glu Val Thr Leu 
1505 1510 1515 1520 

Arg Val Val Val Arg Met Pro Pro Gin His lie lie Lys Gly Asp Leu 
1525 1530 1535 

Lys Gin Gin Glu Phe Phe Leu Gly Cys Ser Lys Val Ser Gly Lys Val 
1540 1545 1550 

Asp Trp Lys Met Leu Asp Glu Ala Val Phe Gin Val Phe Lys Asp Tyr 
1555 1560 1565 

lie Ser Lys Met Asp Pro Ala Ser Thr Leu Gly Leu Ser Thr Glu Ser 
1570 1575 1580 



lie His Gly Tyr Ser lie Ser His 
1585 1590 



Ser Leu Lys Gly Leu Lys Glu Lys 
1620 



Val Lys Arg Val Leu Asp Ala Glu 
1595 1600 

Ser Val 
1615 

Cys Val Asp Ser Leu Val Phe Glu 
1625 1630 



Pro Pro Glu Met Pro Pro Cys Arg Arg Gly Val Asn Asn lie 
1605 1610 



Thr Leu lie Pro Lys Pro Met Met Gin His Tyr lie Ser Leu Leu Leu 
1635 1640 1645 

Lys His Arg Arg Leu Val Leu Ser Gly Pro Ser Gly Thr Gly Lys Thr 
1650 1655 1660 
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Tyr Leu Thr Asn Arg Leu Ala Glu Tyr Leu Val Glu Arg Ser Gly Arg 
1665 1670 1675 1680 



Glu Val Thr Glu Gly lie Val Ser Thr Phe Asn Met His Gin Gin Ser 
1685 1690 1695 

Cys Lys Asp Leu Gin Leu Tyr Leu Ser Asn Leu Ala Asn Gin lie Asp 
1700 1705 1710 

Arg Glu Thr Gly lie Gly Asp Val Pro Leu Val lie Leu Leu Asp Asp 
1715 1720 1725 

Leu Ser Glu Ala Gly Ser lie Ser Glu Leu Val Asn Gly Ala Leu Thr 
1730 1735 1740 

Cys Lys Tyr His Lys Cys Pro Tyr lie lie Gly Thr Thr Asn Gin Pro 
1745 1750 1755 1760 

Val Lys Met Thr Pro Asn His Gly Phe His Leu Ser Phe Arg Met Leu 
1765 1770 1775 

Thr Phe Ser Asn Asn Val Glu Pro Ala Asn Gly Phe Leu Val Arg Tyr 
1780 1785 1790 

Leu Arg Arg Lys Leu Val Glu Ser Asp Ser Asp lie Asn Ala Asn Lys 
1795 1800 1805 

Glu Glu Leu Leu Arg Val Leu Asp Trp Val Pro Lys Leu Trp Tyr His 
1810 1815 1820 

Leu His Thr Phe Leu Glu Lys His Ser Thr Ser Asp Phe Leu lie Gly 
1825 1830 1835 1840 

Pro Cys Phe Phe Leu Ser Cys Pro lie Gly lie Glu Asp Phe Arg Thr 
1845 1850 1855 

Trp Phe lie Asp Leu Trp Asn Asn Ser lie lie Pro Tyr Leu Gin Glu 
1860 1865 1870 

Gly Ala Lys Asp Gly lie Lys Val His Gly Gin Lys Ala Ala Trp Glu 
1875 1880 1885 

Asp Pro Val Glu Trp Val Arg Asp Thr Leu Pro Trp Pro Ser Ala Gin 
1890 1895 1900 

Gin Asp Gin Ser Lys Leu Tyr His Leu Pro Pro Pro Thr Val Gly Pro 
1905 1910 1915 1920 

His Ser lie Ala Ser Pro Pro Glu Asp Arg Thr Val Lys Asp Ser Thr 
1925 1930 1935 

Pro Ser Ser Leu Asp Ser Asp Pro Leu Met Ala Met Leu Leu Lys Leu 
1940 1945 1950 

Gin Glu Ala Ala Asn Tyr lie Glu Ser Pro Asp Arg Glu Thr lie Leu 
1955 1960 1965 
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Asp Pro Asn Leu Gin Ala Thr Leu 
1970 1975 



<210> 26 

<211> 7783 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<223> (2) ". . (7141) 

<400> 26 



gcc aaa tec ggc cac aag cgt etc ate agg gat etc cag caa gat gtg 
Ala Lys Ser Gly His Lys Arg Leu lie Arg Asp Leu Gin Gin Asp Val 



aag att gaa gac ate aat ggc tgt ccg aag aac aga tec caa atg att 
Lys lie Glu Asp lie Asn Gly Cys Pro Lys Asn Arg Ser Gin Met lie 



ate cag ggg ctg tct gca gaa gag ate agg aat gga aac etc aag gcc 
lie Gin Gly Leu Ser Ala Glu Glu lie Arg Asn Gly Asn Leu Lys Ala 



att eta ggc etc ttc ttc age etc tec cga tac aag cag cag cag cag 
lie Leu Gly Leu Phe Phe Ser Leu Ser Arg Tyr Lys Gin Gin Gin Gin 



cag ccc cag aag cag cac etc tec tea cct ctg ccg ccc gcc gta tec 
Gin Pro Gin Lys Gin His Leu Ser Ser Pro Leu Pro Pro Ala Val Ser 



46 



94 



c aac cag cca gaa cgc ctg aac teg cag gtg ctg cag ggg ctg cag 
Asn Gin Pro Glu Arg Leu Asn Ser Gin Val Leu Gin Gly Leu Gin 

gag cca gcg ggg gag ggg etc ccg ctg egg aag age ggc teg gtg gaa 
Glu Pro Ala Gly Glu Gly Leu Pro Leu Arg Lys Ser Gly Ser Val Glu 

aac ggg ttc gat ace cag ate tac aca gac tgg gcc aat cat tac eta 142 
Asn Gly Phe Asp Thr Gin lie Tyr Thr Asp Trp Ala Asn His Tyr Leu 



190 



aca gat ggc gtc etc ctg gcc cag att ate cag gtt gtg gca aat gaa 23 8 

Thr Asp Gly Val Leu Leu Ala Gin lie lie Gin Val Val Ala Asn Glu 



286 



g aa aac ata gat gcc tgc ttg aat ttc ctg gca get aag gga ata aac 334 
Glu Asn lie Asp Ala Cys Leu Asn Phe Leu Ala Ala Lys Gly lie Asn 



382 



430 



478 



69 



cag gtg gcc ggg gcc ccc tec cag tgc cag get ggc ace cct cag cag 
Gin Val Ala Gly Ala Pro Ser Gin Cys Gin Ala Gly Thr Pro Gin Gin 



526 



cag gtg cca gtc act ccc caa gcc ccg tgc cag cct cac cag cca gcg 
Gin Val Pro Val Thr Pro Gin Ala Pro Cys Gin Pro His Gin Pro Ala 



574 



cca cat cag cag tea aaa gca caa get gaa atg cag tec aga ctt cca 
Pro His Gin Gin Ser Lys Ala Gin Ala Glu Met Gin Ser Arg Leu Pro 



622 



ggt cct acc gcg agg gta tec get gca ggc age gag gcc aaa aca cgc 
Gly Pro Thr Ala Arg Val Ser Ala Ala Gly Ser Glu Ala Lys Thr Arg 



670 



gga ggg tea act act get aac aac cga cgc age cag age ttt aac aac 
Gly Gly Ser Thr Thr Ala Asn Asn Arg Arg Ser Gin Ser Phe Asn Asn 



718 



tat gat aaa tec aaa cca gtc acc tec cca ccc cca ccg cca age age 
Tyr Asp Lys Ser Lys Pro Val Thr Ser Pro Pro Pro Pro Pro Ser Ser 



766 



cac gag aaa gag cct ttg gca agt tea gcc tec tec cac ccc gga atg 
His Glu Lys Glu Pro Leu Ala Ser Ser Ala Ser Ser His Pro Gly Met 



814 



agt gac aat gca cct get tec ttg gag age ggc age age tec acc cct 
Ser Asp Asn Ala Pro Ala Ser Leu Glu Ser Gly Ser Ser Ser Thr Pro 



862 



act aat tgc agt acc tec teg gcc ate ccg cag ccc ggt gca gcc acc 
Thr Asn Cys Ser Thr Ser Ser Ala lie Pro Gin Pro Gly Ala Ala Thr 



910 



aag cct tgg cgc age aaa tec etc age gtg aag cac agt gcc acg gta 
Lys Pro Trp Arg Ser Lys Ser Leu Ser Val Lys His Ser Ala Thr Val 



958 



tec atg etc teg gtc aag cct cct ggg cct gag gcc ccc agg ccc aca 
Ser Met Leu Ser Val Lys Pro Pro Gly Pro Glu Ala Pro Arg Pro Thr 



1006 



cct gaa gcc atg aag ccg gcc ccc aac aat cag aag tec atg ctg gaa 
Pro Glu Ala Met Lys Pro Ala Pro Asn Asn Gin Lys Ser Met Leu Glu 



1054 



aag ctg aaa ctt ttc aac agt aaa ggg ggc tea aag gca ggt gag ggg 
Lys Leu Lys Leu Phe Asn Ser Lys Gly Gly Ser Lys Ala Gly Glu Gly 



1102 



ccg ggg tec egg gac aca age tgt gag egg ctg gag act ctg ccc age 
Pro Gly Ser Arg Asp Thr Ser Cys Glu Arg Leu Glu Thr Leu Pro Ser 



1150 



70 
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ttc gaa gag age gag gag 
Phe Glu Glu Ser Glu Glu 



9tg ggc cct get tec age 
Val Gly Pro Ala Ser Ser 



cag agg act ttt age egg 
Gin Arg Thr Phe Ser Arg 



ggc aat gag aaa gag aag 
Gly Asn Glu Lys Glu Lys 



aaa age aag gac ctt gee 
Lys Ser Lys Asp Leu Ala 



etc aag gag gag cca aaa 
Leu Lys Glu Glu Pro Lys 



atg cca aaa aag tec tec 
Met Pro Lys Lys Ser Ser 



aag etc aac agt gee aag 
Lys Leu Asn Ser Ala Lys 



ata cca aaa cca gga atg 
lie Pro Lys Pro Gly Met 



cca gcg cct tec aag gaa 
Pro Ala Pro Ser Lys Glu 



tea gga etc ccc cag cag 
Ser Gly Leu Pro Gin Gin 



tec tct tec age ctg gcg 
Ser Ser Ser Ser Leu Ala 



acc ctg aac cac age ate 
Thr Leu Asn His Ser lie 



acc acc cag acc aca gga 
Thr Thr Gin Thr Thr Gly 



ctg gag gec gee agt cgc 
Leu Glu Ala Ala Ser Arg 



age ccc aag att gca etc 
Ser Pro Lys lie Ala Leu 



gca ctg acc aac aag aag 
Ala Leu Thr Asn Lys Lys 



gag aaa caa cag egg gag 
Glu Lys Gin Gin Arg Glu 



aag aga gee tct gtg acg 
Lys Arg Ala Ser Val Thr 



gaa gac ccc agt gga gca 
Glu Asp Pro Ser Gly Ala 



aag att gee age ttc ate 
Lys lie Ala Ser Phe lie 



aag gag ccc atg gee cct 
Lys Glu Pro Met Ala Pro 



aag age atg ccc ggg aaa 
Lys Ser Met Pro Gly Lys 



ggg gag egg age egg agt 
Gly Glu Arg Ser Arg Ser 



aag ccc cag ctg gac ggc 
Lys Pro Gin Leu Asp Gly 



tec tea gaa gga aaa ggc 
Ser Ser Glu Gly Lys Gly 



age age cag act gtc agt 
Ser Ser Gin Thr Val Ser 



age aat acc gtc agt gtt 
Ser Asn Thr Val Ser Val 



atg etc acc acc 1198 
Met Leu Thr Thr 



aag ggc att gee 1246 
Lys Gly lie Ala 



agt tct ctg aaa 1294 
Ser Ser Leu Lys 



aag gat aag gag 1342 
Lys Asp Lys Glu 



gag agg ctg gac 13 90 

Glu Arg Leu Asp 



get gtg ccc gag 1438 
Ala Val Pro Glu 



ccc aaa ggg ggg 14 8 6 

Pro Lys Gly Gly 



tec cac agt gga 1534 
Ser His Ser Gly 



tec cca agt gec 1582 
Ser Pro Ser Ala 



ggg aag ctg age 1630 
Gly Lys Leu Ser 



aga cac tec agt 1678 
Arg His Ser Ser 



cca gga ggg acc 1726 
Pro Gly Gly Thr 



ggg tct gtc ggg 1774 
Gly Ser Val Gly 



cag eta cct cag 1822 
Gin Leu Pro Gin 
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ccc cag cag caa tac aac cat ccc aac 
Pro Gin Gin Gin Tyr Asn His Pro Asn 



ctg tac agg tct cag acg gac act gaa 
Leu Tyr Arg Ser Gin Thr Asp Thr Glu 



age tea aca ggt gtg age gtg gag ccc 
Ser Ser Thr Gly Val Ser Val Glu Pro 



cag cct get ctg gaa gaa etc act ggg 
Gin Pro Ala Leu Glu Glu Leu Thr Gly 



ctg egg aca gtg aag aac ate get gat 
Leu Arg Thr Val Lys Asn lie Ala Asp 



acc atg tec agt tta agg gga act cag 
Thr Met Ser Ser Leu Arg Gly Thr Gin 



acc acg ttt gac acc aat gtc acc acg 
Thr Thr Phe Asp Thr Asn Val Thr Thr 



etc age ttg aca ggg agg ccc aca cct 
Leu Ser Leu Thr Gly Arg Pro Thr Pro 



tec age cct egg etc caa gca gga gac 
Ser Ser Pro Arg Leu Gin Ala Gly Asp 



tat ccc cct cga gee aac gee age agg 
Tyr Pro Pro Arg Ala Asn Ala Ser Arg 



cgc tat gtg tac tec gec cct ctg aga 
Arg Tyr Val Tyr Ser Ala Pro Leu Arg 



agt agt gtc tgc cac gtg gac gtc tea 
Ser Ser Val Cys His Val Asp Val Ser 



gac ctg gaa ggc ate age atg gac gee 
Asp Leu Glu Gly lie Ser Met Asp Ala 



gat gtt ctg age aag aac ate egg acc 
Asp Val Leu Ser Lys Asn lie Arg Thr 



atg act gat ggt gga ctt ggc etc tat 



act gec acg gtt gca cct ttc 1870 
Thr Ala Thr Val Ala Pro Phe 



ggg aat gtt act gec gag tea 1918 
Gly Asn Val Thr Ala Glu Ser 



age cac ttc acc aag act gga 1966 
Ser His Phe Thr Lys Thr Gly 



gaa gat cct gag get egg egg 2 014 

Glu Asp Pro Glu Ala Arg Arg 



ctg egg cag aat ttg gag gaa 2062 
Leu Arg Gin Asn Leu Glu Glu 



gtt aca cac age aca ttg gaa 2110 
Val Thr His Ser Thr Leu Glu 



gag atg agt ggc cgt age ata 2158 
Glu Met Ser Gly Arg Ser lie 



ctg tec tgg aga ctg ggc cag 2206 
Leu Ser Trp Arg Leu Gly Gin 



gee ccc tea atg ggc aat ggg 2254 
Ala Pro Ser Met Gly Asn Gly 



ttc ate aac act gag tea ggt 2302 
Phe lie Asn Thr Glu Ser Gly 



agg cag ctg gee tec egg ggc 2 3 50 

Arg Gin Leu Ala Ser Arg Gly 



gac aag gca gga gat gag atg 2 3 98 

Asp Lys Ala Gly Asp Glu Met 



ccc ggc tac atg age gat ggg 2446 
Pro Gly Tyr Met Ser Asp Gly 



gat gac att aca age gga tac 24 94 

Asp Asp lie Thr Ser Gly Tyr 



acc cgt cgc ctg aac egg etc 2542 
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Met Thr Asp Gly Gly Leu Gly Leu Tyr Thr Arg Arg Leu Asn Arg Leu 



cct gat ggg atg get gtg gta egg gag acc ctg caa cga aat acc tec 2590 
Pro Asp Gly Met Ala Val Val Arg Glu Thr Leu Gin Arg Asn Thr Ser 



ctg ggc etc gga gac get gac age tgg gac gac age age tec gtc age 2 63 8 

Leu Gly Leu Gly Asp Ala Asp Ser Trp Asp Asp Ser Ser Ser Val Ser 



age ggc ate age gac acc ata gac aac etc age act gat gac ate aac 2 686 

Ser Gly lie Ser Asp Thr lie Asp Asn Leu Ser Thr Asp Asp lie Asn 



acc age tec tec ate age tct tat gee aac aca cct gee tec tct cga 2734 
Thr Ser Ser Ser lie Ser Ser Tyr Ala Asn Thr Pro Ala Ser Ser Arg 



aaa aac ctg gat gtg cag act gat get gag aag cac tea cag gtg gag 2782 
Lys Asn Leu Asp Val Gin Thr Asp Ala Glu Lys His Ser Gin Val Glu 



agg aat tec ctg tgg tct ggt gat gat gtc aag aaa tea gac gga ggc 2 83 0 

Arg Asn Ser Leu Trp Ser Gly Asp Asp Val Lys Lys Ser Asp Gly Gly 



tea gac age ggc ata aaa atg gag cca ggt tec aag tgg agg egg aat 2 878 

Ser Asp Ser Gly lie Lys Met Glu Pro Gly Ser Lys Trp Arg Arg Asn 



cct tct gat gtg tct gac gag tec gac aaa age acg teg ggc aag aag 2926 
Pro Ser Asp Val Ser Asp Glu Ser Asp Lys Ser Thr Ser Gly Lys Lys 



aat cct gtc ate tec cag aca ggc tea tgg egg cga ggc atg aca get 2 974 

Asn Pro Val lie Ser Gin Thr Gly Ser Trp Arg Arg Gly Met Thr Ala 



cag gtg ggc ate acc atg cca agg acg aag get tea gee ccg gca ggc 3022 
Gin Val Gly lie Thr Met Pro Arg Thr Lys Ala Ser Ala Pro Ala Gly 



gca ctg aag acc cca gga act gga aaa aca gac gac gca aag gtg tct 3070 
Ala Leu Lys Thr Pro Gly Thr Gly Lys Thr Asp Asp Ala Lys Val Ser 



gag aaa gga agg ctt tct cct aaa gee tec cag gtg aag cgc tec cca 3118 
Glu Lys Gly Arg Leu Ser Pro Lys Ala Ser Gin Val Lys Arg Ser Pro 



tea gat gca ggc egg age agt ggt gac gaa tec aaa aag ccc etc ccc 3166 
Ser Asp Ala Gly Arg Ser Ser Gly Asp Glu Ser Lys Lys Pro Leu Pro 



age age tct agg aca cct act gee aat gec aac age ttt ggg ttc aag 3214 
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Ser Ser Ser Arg Thr Pro Thr Ala Asn Ala Asn Ser Phe Gly Phe Lys 

aag cag agt ggt tec gec acc ggc ctg gec atg ate aca gec age ggg 3262 
Lys Gin Ser Gly Ser Ala Thr Gly Leu Ala Met lie Thr Ala Ser Gly 

gtg act gtc acc age agg tea gee aca ctg ggc aaa ate cca aag tea 3310 
Val Thr Val Thr Ser Arg Ser Ala Thr Leu Gly Lys lie Pro Lys Ser 

tct gca etc gtc agt egg tct get ggt egg aag tea agt atg gat ggg 3358 
Ser Ala Leu Val Ser Arg Ser Ala Gly Arg Lys Ser Ser Met Asp Gly 

get cag aat cag gat gac ggg tat eta gec eta age tec egg aca aac 3406 
Ala Gin Asn Gin Asp Asp Gly Tyr Leu Ala Leu Ser Ser Arg Thr Asn 

ctt cag tac egg agt ttg ccg agg ccc agt aag tec aac age egg aac 3454 
Leu Gin Tyr Arg Ser Leu Pro Arg Pro Ser Lys Ser Asn Ser Arg Asn 

ggg get ggg aac agg tct age acc age age ata gat tec aac att age 3502 
Gly Ala Gly Asn Arg Ser Ser Thr Ser Ser lie Asp Ser Asn lie Ser 

age aag tec gca ggc ctg cca gtg ccc aaa ctg agg gag cct tec aaa 3550 
Ser Lys Ser Ala Gly Leu Pro Val Pro Lys Leu Arg Glu Pro Ser Lys 

aca gec eta ggc age tct eta cca ggt ctg gtc aac caa aca gac aag 3598 
Thr Ala Leu Gly Ser Ser Leu Pro Gly Leu Val Asn Gin Thr Asp Lys 

gag aaa ggc ate tea tea gac aac gag agt gtg get tec tgt aac teg 3 646 

Glu Lys Gly lie Ser Ser Asp Asn Glu Ser Val Ala Ser Cys Asn Ser 

gtg aaa gtg aat ccg gca gec cag cct gtg tec agt ccg get cag acc 3694 
Val Lys Val Asn Pro Ala Ala Gin Pro Val Ser Ser Pro Ala Gin Thr 

agt etc cag cct gga gec aag tac cca gat gtg gec tct ccc aca etc 3742 
Ser Leu Gin Pro Gly Ala Lys Tyr Pro Asp Val Ala Ser Pro Thr Leu 

cgc aga etc ttt ggt ggg aag cct acc aag caa gtg ccc ate gec aca 3790 
Arg Arg Leu Phe Gly Gly Lys Pro Thr Lys Gin Val Pro lie Ala Thr 

get gaa aac atg aaa aat teg gtg gtc ate tec aat cct cat gec acc 3838 
Ala Glu Asn Met Lys Asn Ser Val Val lie Ser Asn Pro His Ala Thr 

atg act cag caa ggt aac eta gac tec ccg tea ggc agt ggc gtc ctg 3886 
Met Thr Gin Gin Gly Asn Leu Asp Ser Pro Ser Gly Ser Gly Val Leu 
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age agt ggg age 
Ser Ser Gly Ser 



cag tct ccg eta 
Gin Ser Pro Leu 



age agt cct etc 
Ser Ser Pro Leu 



gec tec age ccc 
Ala Ser Ser Pro 



tac age aag aat 
Tyr Ser Lys Asn 



age tea gee cac 
Ser Ser Ala His 



gtg gac etc aac 
Val Asp Leu Asn 



teg gee cct tec 
Ser Ala Pro Ser 



aac age etc ace tgg ggc ace aac gee age age tec tec gca gtt age 
Asn Ser Leu Thr Trp Gly Thr Asn Ala Ser Ser Ser Ser Ala Val Ser 



aag gat ggc ctg ggc ttt cag tct gtc age age etc cac acc age tgt 
Lys Asp Gly Leu Gly Phe Gin Ser Val Ser Ser Leu His Thr Ser Cys 



gag tec ate gac ate tec etc age agt gga ggg gtc ccc age cac aat 
Glu Ser lie Asp lie Ser Leu Ser Ser Gly Gly Val Pro Ser His Asn 



tct tec act ggc etc ate gee tec tec aag gac gac tec ttg act ccc 
Ser Ser Thr Gly Leu lie Ala Ser Ser Lys Asp Asp Ser Leu Thr Pro 



ttt gtc aga act aac agt gtg aag acc aca ctg tea gaa age cct etc 
Phe Val Arg Thr Asn Ser Val Lys Thr Thr Leu Ser Glu Ser Pro Leu 



tct tec cct get get age cct aag ttc tgc aga agt act ctg ccc agg 
Ser Ser Pro Ala Ala Ser Pro Lys Phe Cys Arg Ser Thr Leu Pro Arg 



aaa cag gac agt gac ccg cac ctt gat agg aac act ttg cct aag aaa 
Lys Gin Asp Ser Asp Pro His Leu Asp Arg Asn Thr Leu Pro Lys Lys 



gga etc agg tat act ccc acc tec cag ctt cgc acg caa gaa gat gca 
Gly Leu Arg Tyr Thr Pro Thr Ser Gin Leu Arg Thr Gin Glu Asp Ala 



aaa gaa tgg tta egg tec cat tct gca gga ggc ctt cag gac acc get 
Lys Glu Trp Leu Arg Ser His Ser Ala Gly Gly Leu Gin Asp Thr Ala 



gee aat tec ccc ttt tec tct ggc tec age gtg act tct ccc tec gga 
Ala Asn Ser Pro Phe Ser Ser Gly Ser Ser Val Thr Ser Pro Ser Gly 



aca aga ttc aac ttt tec cag ctt gcg agt ccc acc act gtc acc cag 
Thr Arg Phe Asn Phe Ser Gin Leu Ala Ser Pro Thr Thr Val Thr Gin 



atg age ttg tec aac ccg acc atg ctg agg act cac age etc tec aat 
Met Ser Leu Ser Asn Pro Thr Met Leu Arg Thr His Ser Leu Ser Asn 



3934 



3982 



4030 



4078 



4126 



4174 



4222 



4270 



4318 



4366 



4414 



4462 



4510 



4558 
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get gat ggg cag tat gat cca tac act 
Ala Asp Gly Gin Tyr Asp Pro Tyr Thr 



tec atg tec ctg gat gag aag age aga 
Ser Met Ser Leu Asp Glu Lys Ser Arg 



ttc egg gat ggg ttt gaa gaa gtt cat 
Phe Arg Asp Gly Phe Glu Glu Val His 



tec age aca teg tea gtt tat tct aca 
Ser Ser Thr Ser Ser Val Tyr Ser Thr 



gag att cgc aag ctg egg egg gaa ctg 
Glu lie Arg Lys Leu Arg Arg Glu Leu 



tea get ttg ace ace cag ctg aca gca 
Ser Ala Leu Thr Thr Gin Leu Thr Ala 



ttt gaa cag agt ctt ggt aac atg aca 
Phe Glu Gin Ser Leu Gly Asn Met Thr 



atg aca get gag cag aag gat tea gaa 
Met Thr Ala Glu Gin Lys Asp Ser Glu 



att gag ctg eta aag aaa cag aac gca 
lie Glu Leu Leu Lys Lys Gin Asn Ala 



gga gta att aac aca cct gag etc aac 
Gly Val lie Asn Thr Pro Glu Leu Asn 



cag tct gca gac etc cgc ate cgc agg 
Gin Ser Ala Asp Leu Arg lie Arg Arg 



tec age ate aac agt gee ace age cac 
Ser Ser lie Asn Ser Ala Thr Ser His 



gag agt gac tea aag aag aag aag agg 
Glu Ser Asp Ser Lys Lys Lys Lys Arg 



cgc age tec ttc aag caa get ttc ggg 
Arg Ser Ser Phe Lys Gin Ala Phe Gly 
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gac age cgc ttc egg aat age 4606 
Asp Ser Arg Phe Arg Asn Ser 



acc atg age cgt tea ggc tea 4654 
Thr Met Ser Arg Ser Gly Ser 



gga tec tea etc tec ttg gtt 4702 
Gly Ser Ser Leu Ser Leu Val 



cca gaa gaa aaa tgc cag tea 4750 
Pro Glu Glu Lys Cys Gin Ser 



gat gee tec cag gag aaa gtt 4798 
Asp Ala Ser Gin Glu Lys Val 



aat get cac ctt gtg get gee 4846 
Asn Ala His Leu Val Ala Ala 



ate agg etc cag agt ctg acc 4 894 

lie Arg Leu Gin Ser Leu Thr 



ctg aat gag tta aga aaa acc 4 942 

Leu Asn Glu Leu Arg Lys Thr 



get gee cag get gee att aat 4990 
Ala Ala Gin Ala Ala lie Asn 



tgc aaa gga aac ggc act gee 5038 
Cys Lys Gly Asn Gly Thr Ala 



cag cac tec tea gac age gtc 5086 
Gin His Ser Ser Asp Ser Val 



tec agt gtg ggc age aac ata 5134 
Ser Ser Val Gly Ser Asn lie 



aag aac tgg gtc aat gag tta 5182 
Lys Asn Trp Val Asn Glu Leu 

aag aag aag tec cca aaa tct 5230 
Lys Lys Lys Ser Pro Lys Ser 
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gcg tec tct cat tea gat att gag gag atg acg gat tct tct ttg cct 5278 

Ala Ser Ser His Ser Asp lie Glu Glu Met Thr Asp Ser Ser Leu Pro 

tec tea cca aag tta cca cac aat ggg tec aca ggt tec acc cca ctg 5326 

Ser Ser Pro Lys Leu Pro His Asn Gly Ser Thr Gly Ser Thr Pro Leu 

ctg agg aat tct cac tec aac tct eta att tea gaa tgc atg gat agt 5374 

Leu Arg Asn Ser His Ser Asn Ser Leu lie Ser Glu Cys Met Asp Ser 

gaa get gag acc gtc atg cag etc cga aat gag tta aga gac aag gag 5422 

Glu Ala Glu Thr Val Met Gin Leu Arg Asn Glu Leu Arg Asp Lys Glu 



atg aag ctg aca gat ate cgc tta gaa get etc agt tct gec cac cag 
Met Lys Leu Thr Asp lie Arg Leu Glu Ala Leu Ser Ser Ala His Gin 



aag tgg aag gag gat tec aga cca cat etc ttt ctt att ggc tgc att 
Lys Trp Lys Glu Asp Ser Arg Pro His Leu Phe Leu lie Gly Cys lie 



egg ctg ttc aaa gaa tap ate att cat gtc gac cca gtg agt cag eta 
Arg Leu Phe Lys Glu Tyr lie lie His Val Asp Pro Val Ser Gin Leu 



5470 



ctg gac cag etc egg gag gee atg aac agg atg cag agt gaa ata gag 5518 
Leu Asp Gin Leu Arg Glu Ala Met Asn Arg Met Gin Ser Glu lie Glu 

aag ctg aaa get gag aat gat egg ctg aag tea gag tct caa ggc agt 5566 
Lys Leu Lys Ala Glu Asn Asp Arg Leu Lys Ser Glu Ser Gin Gly Ser 

ggc tgc age egg get cct tec caa gtg tec ate tct gee tec ccg agg 5614 
Gly Cys Ser Arg Ala Pro Ser Gin Val Ser lie Ser Ala Ser Pro Arg 

cag tec atg ggc etc tec cag cac age ttg aac etc act gag tea acc 5662 
Gin Ser Met Gly Leu Ser Gin His Ser Leu Asn Leu Thr Glu Ser Thr 

age ctg gac atg ttg ctg gat gac act ggt gaa tgc teg get egg aag 5710 
Ser Leu Asp Met Leu Leu Asp Asp Thr Gly Glu Cys Ser Ala Arg Lys 

gaa gga ggc agg cat gtt aag ata gtt gtc age ttt cag gag gaa atg 5758 
Glu Gly Gly Arg His Val Lys lie Val Val Ser Phe Gin Glu Glu Met 
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gga gtt agt ggc aag acg aag tgg gat gtg etc gat ggg gtg gtt aga 5854 
Gly Val Ser Gly Lys Thr Lys Trp Asp Val Leu Asp Gly Val Val Arg 



5902 



ggg ctg aat tea gac age gtt ctt ggc tac age att gga gaa ate aag 5 95 0 

Gly Leu Asn Ser Asp Ser Val Leu Gly Tyr Ser lie Gly Glu lie Lys 
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cgc age aac act tec gaa aca ccg gag ctg ctt cct tgt ggc tat ctg 5998 

Arg Ser Asn Thr Ser Glu Thr Pro Glu Leu Leu Pro Cys Gly Tyr Leu 

9tt gga gag aac acg acc ate tea gtg act gtg aaa ggg etc gca gaa 6046 

Val Gly Glu Asn Thr Thr lie Ser Val Thr Val Lys Gly Leu Ala Glu 

aac age ctg gac tea ctg gtg ttt gag tec ttg att ccc aag ccc ate 6094 

Asn Ser Leu Asp Ser Leu Val Phe Glu Ser Leu lie Pro Lys Pro lie 

ctg cag cgc tac gtc tec etc ctg ata gag cac cgt egg ate att etc 6142 

Leu Gin Arg Tyr Val Ser Leu Leu lie Glu His Arg Arg lie lie Leu 

tct 99 c ccc a 9 c 99 c act 999 aaa acc tac ct 9 9 CC aac c 99 ct 9 tct 6190 

Ser Gly Pro Ser Gly Thr Gly Lys Thr Tyr Leu Ala Asn Arg Leu Ser 

gag tat ata gtg ctt cga gag gga egg gag ttg aca gac ggg gtt ate 6238 

Glu Tyr lie Val Leu Arg Glu Gly Arg Glu Leu Thr Asp Gly Val lie 

gee acc ttt aac gtg gac cat aag tec age aag gaa ttg cgc cag tac 6286 

Ala Thr Phe Asn Val Asp His Lys Ser Ser Lys Glu Leu Arg Gin Tyr 

ctg tec aac ctt get gac cag tgc aac agt gag aac aat get gtg gac 6334 

Leu Ser Asn Leu Ala Asp Gin Cys Asn Ser Glu Asn Asn Ala Val Asp 

atg ccc etc gtc ate ate ctg gac aac eta cac cac gtg age tct ctg 6382 

Met Pro Leu Val lie lie Leu Asp Asn Leu His His Val Ser Ser Leu 

ggc gag ate ttc aat ggg ctg etc aac tgc aag tac cac aaa tgc cct 6430 

Gly Glu lie Phe Asn Gly Leu Leu Asn Cys Lys Tyr His Lys Cys Pro 

tac ata att ggc aca atg aac cag get acc tct teg act ccc aac ctg 6478 

Tyr lie lie Gly Thr Met Asn Gin Ala Thr Ser Ser Thr Pro Asn Leu 

cag ctt cac cat aac ttc aga tgg gtg ctt tgt gee aac cac acg gag 6526 

Gin Leu His His Asn Phe Arg Trp Val Leu Cys Ala Asn His Thr Glu 

cct gtg aag ggt ttc ctt ggc cga ttc ctg agg agg aag etc atg gaa 6574 

Pro Val Lys Gly Phe Leu Gly Arg Phe Leu Arg Arg Lys Leu Met Glu 

aca gag ate agt ggg egg gtg cgc aat atg gag ctg gta aaa ate att 662 2 

Thr Glu lie Ser Gly Arg Val Arg Asn Met Glu Leu Val Lys lie lie 
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gac tgg att ccc aag gtc tgg cat cac etc aac cgc ttc ctg gag get 
Asp Trp lie Pro Lys Val Trp His His Leu Asn Arg Phe Leu Glu Ala 



cac agt tec teg gac gtc ace ate ggc ccc egg etc ttc ctg tea tgc 
His Ser Ser Ser Asp Val Thr lie Gly Pro Arg Leu Phe Leu Ser Cys 



tac tec age ccc cag age tat gac age gac tec aac age aac age cat 
Tyr Ser Ser Pro Gin Ser Tyr Asp Ser Asp Ser Asn Ser Asn Ser His 



6670 



6718 



ccc ate gat gtg gac ggc teg aga gtg tgg ttc ace gac ttg tgg aac 67 66 

Pro lie Asp Val Asp Gly Ser Arg Val Trp Phe Thr Asp Leu Trp Asn 

tat tec att ate ccc tat etc ctg gaa gee gtc aga gaa gga etc cag 6814 
Tyr Ser lie lie Pro Tyr Leu Leu Glu Ala Val Arg Glu Gly Leu Gin 

etc tat gga agg cgc gee ccc tgg gag gat cct gee aag tgg gtg atg 6862 
Leu Tyr Gly Arg Arg Ala Pro Trp Glu Asp Pro Ala Lys Trp Val Met 

gac aca tat cca tgg gca gee age cca caa cag cac gag tgg cct ccc 
Asp Thr Tyr Pro Trp Ala Ala Ser Pro Gin Gin His Glu Trp Pro Pro 

ctg ctg cag tta egg cct gag gat gtc ggc ttc gac ggc tac tec atg 
Leu Leu Gin Leu Arg Pro Glu Asp Val Gly Phe Asp Gly Tyr Ser Met 

cct egg gag gga teg aca age aag cag atg ccc ccc agt gat get gaa 
Pro Arg Glu Gly Ser Thr Ser Lys Gin Met Pro Pro Ser Asp Ala Glu 

ggt gac ccg ctg atg aac atg ctg atg agg ctg cag gag gca gee aac 7054 
Gly Asp Pro Leu Met Asn Met Leu Met Arg Leu Gin Glu Ala Ala Asn 
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6958 



7006 



7102 



cac gat gac ate ttg gac tec tct ttg gag tec act ctg tgacaggggc 7151 
His Asp Asp lie Leu Asp Ser Ser Leu Glu Ser Thr Leu 

ccggagccca gcgccctcct cttctcctca ccgcattcca cctgcatccc ccacatcacc 7211 

ctgaagatga cttcctgagc cagcccccag ccacagcctt agagctgegg gaacaccgag 7271 

accccccgtc cttcagcctc gacctgggtg caggcatccc gggccagctg cctgcggacc 7331 

gcttccttcc acagegagaa ctgcactacc ttctgttgta ctttaattat tgttttgcct 7391 

tgttgctgtg acctccctaa gacactgaag atacttctcg ggaaaggatc atcgccgttg 7451 

aaatgaaaag agagacagag agagaaaaaa aaaagagaac ccacatgaag ctctgaaacc 7511 
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aaacagcatc ctgccatgag cttcccagag acagaagaga ctggagcaaa gtcggaaaca 7571 

cagagaagca cggcttcccc tcagcacaga ccctccagac tgggtctcag agccgtgcca 7631 

cccaccctcc cacacagccg gccacaggga gaactggtgc taaccagggt gcttgctttg 7691 

gtcacgttca acgcactaca gagctacgac acaggggaac cttaggagca aataaaccgt 7751 

gctttcatgt tttttaaaaa aaaaaaaaaa aa 7783 



<210> 27 

<211> 2380 

<212> PRT 

<213> Homo sapiens 

<400> 27 



Asn Gin Pro Glu Arg Leu Asn Ser Gin Val Leu Gin Gly Leu Gin Glu 
15 10 15 

Pro Ala Gly Glu Gly Leu Pro Leu Arg Lys Ser Gly Ser Val Glu Asn 
20 25 30 

Gly Phe Asp Thr Gin lie Tyr Thr Asp Trp Ala Asn His Tyr Leu Ala 
35 40 45 

Lys Ser Gly His Lys Arg Leu lie Arg Asp Leu Gin Gin Asp Val Thr 
50 55 60 

Asp Gly Val Leu Leu Ala Gin lie lie Gin Val Val Ala Asn Glu Lys 
65 70 75 80 

lie Glu Asp lie Asn Gly Cys Pro Lys Asn Arg Ser Gin Met lie Glu 
85 90 95 

Asn He Asp Ala Cys Leu Asn Phe Leu Ala Ala Lys Gly He Asn lie 
100 105 110 

Gin Gly Leu Ser Ala Glu Glu He Arg Asn Gly Asn Leu Lys Ala He 
115 120 125 

Leu Gly Leu Phe Phe Ser Leu Ser Arg Tyr Lys Gin Gin Gin Gin Gin 
130 135 140 

Pro Gin Lys Gin His Leu Ser Ser Pro Leu Pro Pro Ala Val Ser Gin 
145 150 155 160 

Val Ala Gly Ala Pro Ser Gin Cys Gin Ala Gly Thr Pro Gin Gin Gin 
165 170 175 

Val Pro Val Thr Pro Gin Ala Pro Cys Gin Pro His Gin Pro Ala Pro 
180 185 - 190 

His Gin Gin Ser Lys Ala Gin Ala Glu Met Gin Ser Arg Leu Pro Gly 
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195 

Pro Thr Ala Arg 
210 

Gly Ser Thr Thr 
225 

Asp Lys Ser Lys 



Glu Lys Glu Pro 
260 

Asp Asn Ala Pro 
275 

Asn Cys Ser Thr 
290 

Pro Trp Arg Ser 
305 

Met Leu Ser Val 



Glu Ala Met Lys 
340 



200 

Val Ser Ala Ala 
215 

Ala Asn Asn Arg 
230 

Pro Val Thr Ser 
245 

Leu Ala Ser Ser 



Ala Ser Leu Glu 
280 

Ser Ser Ala lie 
295 

Lys Ser Leu Ser 
310 

Lys Pro Pro Gly 
325 

Pro Ala Pro Asn 



Gly Ser Glu Ala 
220 

Arg Ser Gin Ser 
235 

Pro Pro Pro Pro 
250 

Ala Ser Ser His 
265 

Ser Gly Ser Ser 



Pro Gin Pro Gly 
300 

Val Lys His Ser 
315 

Pro Glu Ala Pro 
330 

Asn Gin Lys Ser 
345 



205 

Lys Thr Arg Gly 



Phe Asn Asn Tyr 
240 

Pro Ser Ser His 
255 

Pro Gly Met Ser 
270 

Ser Thr Pro Thr 
285 

Ala Ala Thr Lys 



Ala Thr Val Ser 
320 

Arg Pro Thr Pro 
335 

Met Leu Glu Lys 
350 



Leu Lys Leu Phe 
355 

Gly Ser Arg Asp 
370 

Glu Glu Ser Glu 
385 

Gly Pro Ala Ser 



Arg Thr Phe Ser 
420 

Asn Glu Lys Glu 
435 

Ser Lys Asp Leu 
450 

Lys Glu Glu Pro 
465 

Pro Lys Lys Ser 



Asn Ser Lys Gly 
360 

Thr Ser Cys Glu 
375 

Glu Leu Glu Ala 
390 

Ser Ser Pro Lys 
405 

Arg Ala Leu Thr 



Lys Glu Lys Gin 
440 

Ala Lys Arg Ala 
455 

Lys Glu Asp Pro 
470 

Ser Lys lie Ala 
485 



Gly Ser Lys Ala 



Arg Leu Glu Thr 
380 

Ala Ser Arg Met 
395 

lie Ala Leu Lys 
410 

Asn Lys Lys Ser 
425 

Gin Arg Glu Lys 



Ser Val Thr Glu 
460 

Ser Gly Ala Ala 
475 

Ser Phe lie Pro 
490 



Gly Glu Gly Pro 
365 

Leu Pro Ser Phe 



Leu Thr Thr Val 
400 

Gly lie Ala Gin 
415 

Ser Leu Lys Gly 
430 

Asp Lys Glu Lys 
445 

Arg Leu Asp Leu 



Val Pro Glu Met 
480 

Lys Gly Gly Lys 
495 
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Leu Asn Ser Ala Lys Lys Glu Pro Met Ala Pro Ser His Ser Gly lie 
500 505 510 

Pro Lys Pro Gly Met Lys Ser Met Pro Gly Lys Ser Pro Ser Ala Pro 
515 520 525 

Ala Pro Ser Lys Glu Gly Glu Arg Ser Arg Ser Gly Lys Leu Ser Ser 
530 535 540 

Gly Leu Pro Gin Gin Lys Pro Gin Leu Asp Gly Arg His Ser Ser Ser 
545 550 555 560 

Ser Ser Ser Leu Ala Ser Ser Glu Gly Lys Gly Pro Gly Gly Thr Thr 
565 570 575 

Leu Asn His Ser lie Ser Ser Gin Thr Val Ser Gly Ser Val Gly Thr 
580 585 590 

Thr Gin Thr Thr Gly Ser Asn Thr Val Ser Val Gin Leu Pro Gin Pro 
595 600 605 

Gin Gin Gin Tyr Asn His Pro Asn Thr Ala Thr Val Ala Pro Phe Leu 
610 615 620 

Tyr Arg Ser Gin Thr Asp Thr Glu Gly Asn Val Thr Ala Glu Ser Ser 
625 630 635 640 

Ser Thr Gly Val Ser Val Glu Pro Ser His Phe Thr Lys Thr Gly Gin 
645 650 655 

Pro Ala Leu Glu Glu Leu Thr Gly Glu Asp Pro Glu Ala Arg Arg Leu 
660 665 670 

Arg Thr Val Lys Asn lie Ala Asp Leu Arg Gin Asn Leu Glu Glu Thr 
675 680 685 

Met Ser Ser Leu Arg Gly Thr Gin Val Thr His Ser Thr Leu Glu Thr 
690 695 700 



Thr Phe Asp Thr 
705 

Ser Leu Thr Gly 



Ser Pro Arg Leu 
740 

Pro Pro Arg Ala 
755 

Tyr Val Tyr Ser 
770 

Ser Val Cys His 
785 



Asn Val Thr Thr 
710 

Arg Pro Thr Pro 
725 

Gin Ala Gly Asp 



Asn Ala Ser Arg 
760 

Ala Pro Leu Arg 

775 

Val Asp Val Ser 
790 



Glu Met Ser Gly 
715 

Leu Ser Trp Arg 
730 

Ala Pro Ser Met 
745 

Phe lie Asn Thr 



Arg Gin Leu Ala 
780 

Asp Lys Ala Gly 
795 



Arg Ser lie Leu 
720 

Leu Gly Gin Ser 
735 

Gly Asn Gly Tyr 
750 

Glu Ser Gly Arg 
765 

Ser Arg Gly Ser 



Asp Glu Met Asp 
800 
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Leu Glu Gly He Ser Met Asp Ala Pro Gly Tyr Met Ser Asp Gly Asp 
805 810 815 

Val Leu Ser Lys Asn He Arg Thr Asp Asp He Thr Ser Gly Tyr Met 
820 825 830 

Thr Asp Gly Gly Leu Gly Leu Tyr Thr Arg Arg Leu Asn Arg Leu Pro 
835 840 845 

Asp Gly Met Ala Val Val Arg Glu Thr Leu Gin Arg Asn Thr Ser Leu 
850 855 860 

Gly Leu Gly Asp Ala Asp Ser Trp Asp Asp Ser Ser Ser Val Ser Ser 
865 870 875 880 

Gly He Ser Asp Thr He Asp Asn Leu Ser Thr Asp Asp He Asn Thr 
885 890 895 

Ser Ser Ser He Ser Ser Tyr Ala Asn Thr Pro Ala Ser Ser Arg Lys 
900 905 910 

Asn Leu Asp Val Gin Thr Asp Ala Glu Lys His Ser Gin Val Glu Arg 
915 920 925 

Asn Ser Leu Trp Ser Gly Asp Asp Val Lys Lys Ser Asp Gly Gly Ser 
930 935 940 

Asp Ser Gly He Lys Met Glu Pro Gly Ser Lys Trp Arg Arg Asn Pro 
945 950 955 960 

Ser Asp Val Ser Asp Glu Ser Asp Lys Ser Thr Ser Gly Lys Lys Asn 
965 970 975 

Pro Val He Ser Gin Thr Gly Ser Trp Arg Arg Gly Met Thr Ala Gin 
980 985 990 

Val Gly He Thr Met Pro Arg Thr Lys Ala Ser Ala Pro Ala Gly Ala 
995 1000 1005 

Leu Lys Thr Pro Gly Thr Gly Lys Thr Asp Asp Ala Lys Val Ser Glu 
1010 1015 1020 

Lys Gly Arg Leu Ser Pro Lys Ala Ser Gin Val Lys Arg Ser Pro Ser 
1025 1030 1035 1040 

Asp Ala Gly Arg Ser Ser Gly Asp Glu Ser Lys Lys Pro Leu Pro Ser 
1045 1050 1055 



Ser Ser Arg Thr Pro Thr Ala Asn Ala Asn Ser Phe Gly Phe Lys Lys 
1060 1065 1070 

Gin Ser Gly Ser Ala Thr Gly Leu Ala Met He Thr Ala Ser Gly Val 
1075 1080 1085 

Thr Val Thr Ser Arg Ser Ala Thr Leu Gly Lys He Pro Lys Ser Ser 
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1090 1095 1100 

Ala Leu Val Ser Arg Ser Ala Gly Arg Lys Ser Ser Met Asp Gly Ala 
1105 1110 1115 1120 

Gin Asn Gin Asp Asp Gly Tyr Leu Ala Leu Ser Ser Arg Thr Asn Leu 
1125 1130 1135 

Gin Tyr Arg Ser Leu Pro Arg Pro Ser Lys Ser Asn Ser Arg Asn Gly 
1140 1145 1150 

Ala Gly Asn Arg Ser Ser Thr Ser Ser lie Asp Ser Asn lie Ser Ser 
1155 1160 1165 

Lys Ser Ala Gly Leu Pro Val Pro Lys Leu Arg Glu Pro Ser Lys Thr 
1170 1175 1180 

Ala Leu Gly Ser Ser Leu Pro Gly Leu Val Asn Gin Thr Asp Lys Glu 
1185 1190 1195 1200 

Lys Gly lie Ser Ser Asp Asn Glu Ser Val Ala Ser Cys Asn Ser Val 
1205 1210 1215 

Lys Val Asn Pro Ala Ala Gin Pro Val Ser Ser Pro Ala Gin Thr Ser 
1220 1225 1230 

Leu Gin Pro Gly Ala Lys Tyr Pro Asp Val Ala Ser Pro Thr Leu Arg 
1235 1240 1245 

Arg Leu Phe Gly Gly Lys Pro Thr Lys Gin Val Pro lie Ala Thr Ala 
1250 1255 1260 

Glu Asn Met Lys Asn Ser Val Val lie Ser Asn Pro His Ala Thr Met 
1265 1270 1275 1280 

Thr Gin Gin Gly Asn Leu Asp Ser Pro Ser Gly Ser Gly Val Leu Ser 
1285 1290 1295 

Ser Gly Ser Ser Ser Pro Leu Tyr Ser Lys Asn Val Asp Leu Asn Gin 
1300 1305 1310 

Ser Pro Leu Ala Ser Ser Pro Ser Ser Ala His Ser Ala Pro Ser Asn 
1315 1320 1325 

Ser Leu Thr Trp Gly Thr Asn Ala Ser Ser Ser Ser Ala Val Ser Lys 
1330 1335 1340 

Asp Gly Leu Gly Phe Gin Ser Val Ser Ser Leu His Thr Ser Cys Glu 
1345 1350 1355 1360 

Ser lie Asp lie Ser Leu Ser Ser Gly Gly Val Pro Ser His Asn Ser 
1365 1370 1375 

Ser Thr Gly Leu lie Ala Ser Ser Lys Asp Asp Ser Leu Thr Pro Phe 
1380 1385 1390 

Val Arg Thr Asn Ser Val Lys Thr Thr Leu Ser Glu Ser Pro Leu Ser 
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1395 1400 1405 



Ser Pro Ala Ala Ser Pro Lys Phe Cys Arg Ser Thr Leu Pro Arg Lys 
1410 1415 1420 

Gin Asp Ser Asp Pro His Leu Asp Arg Asn Thr Leu Pro Lys Lys Gly 
1425 1430 1435 1440 

Leu Arg Tyr Thr Pro Thr Ser Gin Leu Arg Thr Gin Glu Asp Ala Lys 
1445 1450 1455 

Glu Trp Leu Arg Ser His Ser Ala Gly Gly Leu Gin Asp Thr Ala Ala 
1460 1465 1470 

Asn Ser Pro Phe Ser Ser Gly Ser Ser Val Thr Ser Pro Ser Gly Thr 
1475 1480 1485 

Arg Phe Asn Phe Ser Gin Leu Ala Ser Pro Thr Thr Val Thr Gin Met 
1490 1495 1500 

Ser Leu Ser Asn Pro Thr Met Leu Arg Thr His Ser Leu Ser Asn Ala 
1505 1510 1515 1520 

Asp Gly Gin Tyr Asp Pro Tyr Thr Asp Ser Arg Phe Arg Asn Ser Ser 
1525 1530 1535 

Met Ser Leu Asp Glu Lys Ser Arg Thr Met Ser Arg Ser Gly Ser Phe 
1540 1545 1550 

Arg Asp Gly Phe Glu Glu Val His Gly Ser Ser Leu Ser Leu Val Ser 
1555 1560 1565 

Ser Thr Ser Ser Val Tyr Ser Thr Pro Glu Glu Lys Cys Gin Ser Glu 
1570 1575 1580 

lie Arg Lys Leu Arg Arg Glu Leu Asp Ala Ser Gin Glu Lys Val Ser 
1585 1590 1595 1600 

Ala Leu Thr Thr Gin Leu Thr Ala Asn Ala His Leu Val Ala Ala Phe 
1605 1610 1615 

Glu Gin Ser Leu Gly Asn Met Thr lie Arg Leu Gin Ser Leu Thr Met 
1620 1625 1630 

Thr Ala Glu Gin Lys Asp Ser Glu Leu Asn Glu Leu Arg Lys Thr lie 
1635 1640 1645 

Glu Leu Leu Lys Lys Gin Asn Ala Ala Ala Gin Ala Ala lie Asn Gly 
1650 1655 1660 

Val lie Asn Thr Pro Glu Leu Asn Cys Lys Gly Asn Gly Thr Ala Gin 
1665 1670 1675 1680 

Ser Ala Asp Leu Arg lie Arg Arg Gin His Ser Ser Asp Ser Val Ser 
1685 1690 1695 
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Ser lie Asn Ser Ala Thr Ser His Ser Ser Val Gly Ser Asn lie Glu 
1700 1705 1710 

Ser Asp Ser Lys Lys Lys Lys Arg Lys Asn Trp Val Asn Glu Leu Arg 
1715 1720 1725 

Ser Ser Phe Lys Gin Ala Phe Gly Lys Lys Lys Ser Pro Lys Ser Ala 
1730 1735 1740 

Ser Ser His Ser Asp lie Glu Glu Met Thr Asp Ser Ser Leu Pro Ser 
1745 1750 1755 1760 



Ser Pro Lys Leu Pro His Asn Gly Ser Thr Gly Ser Thr Pro Leu Leu 
1765 1770 1775 

Arg Asn Ser His Ser Asn Ser Leu lie Ser Glu Cys Met Asp Ser Glu 
1780 1785 1790 

Ala Glu Thr Val Met Gin Leu Arg Asn Glu Leu Arg Asp Lys Glu Met 
1795 1800 1805 

Lys Leu Thr Asp lie Arg Leu Glu Ala Leu Ser Ser Ala His Gin Leu 
1810 1815 1820 

Asp Gin Leu Arg Glu Ala Met Asn Arg Met Gin Ser Glu lie Glu Lys 
1825 1830 1835 1840 

Leu Lys Ala Glu Asn Asp Arg Leu Lys Ser Glu Ser Gin Gly Ser Gly 
1845 1850 1855 

Cys Ser Arg Ala Pro Ser Gin Val Ser lie Ser Ala Ser Pro Arg Gin 
1860 1865 1870 

Ser Met Gly Leu Ser Gin His Ser Leu Asn Leu Thr Glu Ser Thr Ser 
1875 1880 1885 

Leu Asp Met Leu Leu Asp Asp Thr Gly Glu Cys Ser Ala Arg Lys Glu 
1890 1895 1900 

Gly Gly Arg His Val Lys lie Val Val Ser Phe Gin Glu Glu Met Lys 
1905 1910 1915 1920 

Trp Lys Glu Asp Ser Arg Pro His Leu Phe Leu lie Gly Cys lie Gly 
1925 1930 1935 

Val Ser Gly Lys Thr Lys Trp Asp Val Leu Asp Gly Val Val Arg Arg 
1940 1945 1950 

Leu Phe Lys Glu Tyr lie lie His Val Asp Pro Val Ser Gin Leu Gly 
1955 1960 1965 

Leu Asn Ser Asp Ser Val Leu Gly Tyr Ser lie Gly Glu lie Lys Arg 
1970 1975 1980 

Ser Asn Thr Ser Glu Thr Pro Glu Leu Leu Pro Cys Gly Tyr Leu Val 
1985 1990 1995 2000 
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Gly Glu Asn Thr Thr lie Ser Val Thr Val Lys Gly Leu Ala Glu Asn 
2005 2010 2015 

Ser Leu Asp Ser Leu Val Phe Glu Ser Leu lie Pro Lys Pro lie Leu 

2020 2025 2030 

Gin Arg Tyr Val Ser Leu Leu lie Glu His Arg Arg lie lie Leu Ser 
2035 2040 2045 

Gly Pro Ser Gly Thr Gly Lys Thr Tyr Leu Ala Asn Arg Leu Ser Glu 
2050 2055 2060 

Tyr lie Val Leu Arg Glu Gly Arg Glu Leu Thr Asp Gly Val lie Ala 
2065 2070 2075 2080 

Thr Phe Asn Val Asp His Lys Ser Ser Lys Glu Leu Arg Gin Tyr Leu 
2085 2090 2095 

Ser Asn Leu Ala Asp Gin Cys Asn Ser Glu Asn Asn Ala Val Asp Met 

210,0 2105 2110 



Pro Leu Val lie lie Leu Asp Asn Leu His His Val Ser Ser Leu Gly 

2115 2120 2125 

Glu lie Phe Asn Gly Leu Leu Asn Cys Lys Tyr His Lys Cys Pro Tyr 

2130 2135 2140 

lie lie Gly Thr Met Asn Gin Ala Thr Ser Ser Thr Pro Asn Leu Gin 

2145 2150 2155 2160 

Leu His His Asn Phe Arg Trp Val Leu Cys Ala Asn His Thr Glu Pro 

2165 2170 2175 

Val Lys Gly Phe Leu Gly Arg Phe Leu Arg Arg Lys Leu Met Glu Thr 

2180 2185 2190 

Glu lie Ser Gly Arg Val Arg Asn Met Glu Leu Val Lys lie lie Asp 

2195 2200 2205 

Trp lie Pro Lys Val Trp His His Leu Asn Arg Phe Leu Glu Ala His 

2210 2215 2220 

Ser Ser Ser Asp Val Thr lie Gly Pro Arg Leu Phe Leu Ser Cys Pro 

2225 2230 2235 2240 

lie Asp Val Asp Gly Ser Arg Val Trp Phe Thr Asp Leu Trp Asn Tyr 

2245 2250 2255 

Ser lie lie Pro Tyr Leu Leu Glu Ala Val Arg Glu Gly Leu Gin Leu 

2260 2265 2270 

Tyr Gly Arg Arg Ala Pro Trp Glu Asp Pro Ala Lys Trp Val Met Asp 

2275 2280 2285 

Thr Tyr Pro Trp Ala Ala Ser Pro Gin Gin His Glu Trp Pro Pro Leu 
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2290 



2295 



2300 



Leu Gin 
2305 



Leu Arg Pro Glu Asp Val Gly Phe Asp Gly Tyr Ser Met Pro 
2310 2315 2320 



Arg Glu 



Gly Ser Thr Ser Lys Gin Met Pro Pro Ser Asp Ala Glu Gly 
2325 2330 2335 



Asp Pro 



Leu Met Asn Met Leu Met Arg Leu Gin Glu Ala Ala Asn Tyr 
2340 2345 2350 



Ser Ser 



Pro Gin Ser Tyr Asp Ser Asp Ser Asn Ser Asn Ser His His 
2355 2360 2365 



Asp Asp 
2370 



lie Leu Asp Ser Ser Leu Glu Ser Thr Leu 
2375 2380 



<210> 28 

<211> 96 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> (1) . . (93) 

<400> 28 

age agg gag agg gga ggg agt gtg ccg tct ctt ctg caa ggg cag tgc 4 8 

Ser Arg Glu Arg Gly Gly Ser Val Pro Ser Leu Leu Gin Gly Gin Cys 

ccc age etc age cac act tct gat ctg cag tec aac aga cct ttc tag 96 
Pro Ser Leu Ser His Thr Ser Asp Leu Gin Ser Asn Arg Pro Phe * 



<210> 29 
<211> 75 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> (1) . . (72) 

<400> 29 

cat gee aaa gag aac ctg ggg gtg cca ggg ggt cct cag age tea cac 48 
His Ala Lys Glu Asn Leu Gly Val Pro Gly Gly Pro Gin Ser Ser His 



tgc act tgt ggc ace cac age gag tag 75 
Cys Thr Cys Gly Thr His Ser Glu * 
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<210> 30 

<211> 489 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (489) 

<400> 30 

cca tec gtg age cga gga aac tgt aca cag ate tac aca gac tgg gee 48 
Pro Ser Val Ser Arg Gly Asn Cys Thr Gin lie Tyr Thr Asp Trp Ala 

aat cat tac eta gec aaa tec ggc cac aag cgt etc ate aag gat etc 96 
Asn His Tyr Leu Ala Lys Ser Gly His Lys Arg Leu lie Lys Asp Leu 

cag caa gat gtg aca gat ggc gtc etc ctg gec cag att ate cag gtt 144 
Gin Gin Asp Val Thr Asp Gly Val Leu Leu Ala Gin lie lie Gin Val 

gtg gca aat gaa aag att gaa gac ate aat ggc tgt ccg aag aac aga 192 
Val Ala Asn Glu Lys lie Glu Asp lie Asn Gly Cys Pro Lys Asn Arg 

tec caa atg att gaa aac ata gat gee tgc ttg aat ttc ctg gca get 240 
Ser Gin Met lie Glu Asn lie Asp Ala Cys Leu Asn Phe Leu Ala Ala 

aag gga ata aac ate cag ggg ctg tct gca gaa gag ate agg aat gga 288 
Lys Gly lie Asn lie Gin Gly Leu Ser Ala Glu Glu lie Arg Asn Gly 

aac etc aag gee att eta ggc etc ttc ttc age etc tec cga tac aag 336 
Asn Leu Lys Ala lie Leu Gly Leu Phe Phe Ser Leu Ser Arg Tyr Lys 



cag cag cag cag cag ccc cag aag cag cac etc tec tea cct ctg ccg 3 84 

Gin Gin Gin Gin Gin Pro Gin Lys Gin His Leu Ser Ser Pro Leu Pro 

ccc gec gta tec cag gtg gec ggg gec ccc tec cag tgc cag get ggc 432 
Pro Ala Val Ser Gin Val Ala Gly Ala Pro Ser Gin Cys Gin Ala Gly 

acc cct cag cag cag gtg cca gtc act ccc caa gec ccg tgc cag cct 480 
Thr Pro Gin Gin Gin Val Pro Val Thr Pro Gin Ala Pro Cys Gin Pro 



cac cag cca 
His Gin Pro 



489 
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<210> 31 
<211> 31 
<212> PRT 

<213> Homo sapiens 
<400> 31 



Ser Arg Glu Arg Gly Gly Ser Val Pro Ser Leu Leu Gin Gly Gin Cys 
15 10 15 

Pro Ser Leu Ser His Thr Ser Asp Leu Gin Ser Asn Arg Pro Phe 
20 25 30 



<210> 32 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 32 



His Ala Lys Glu Asn Leu Gly Val Pro Gly Gly Pro Gin Ser Ser His 
5 10 15 

Cys Thr Cys Gly Thr His Ser Glu 
20 



<210> 33 
<211> 163 
<212> PRT 

<213> Homo sapiens 
<400> 33 



Pro Ser Val Ser Arg Gly Asn Cys Thr Gin lie Tyr Thr Asp Trp Ala 

5 10 15 

Asn His Tyr Leu Ala Lys Ser Gly His Lys Arg Leu lie Lys Asp Leu 
20 25 30 

Gin Gin Asp Val Thr Asp Gly Val Leu Leu Ala Gin lie lie Gin Val 

35 40 45 

Val Ala Asn Glu Lys lie Glu Asp lie Asn Gly Cys Pro Lys Asn Arg 
50 55 60 

Ser Gin Met lie Glu Asn lie Asp Ala Cys Leu Asn Phe Leu Ala Ala 
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65 



70 



75 



80 



Lys Gly lie Asn lie Gin Gly 
85 



Leu Ser Ala Glu Glu lie Arg Asn Gly 
90 95 



Asn Leu Lys Ala lie Leu Gly 
100 



Leu Phe Phe Ser Leu Ser Arg Tyr Lys 
105 110 



Gin Gin Gin Gin Gin Pro Gin 
115 



Lys Gin His Leu Ser Ser Pro Leu Pro 
120 125 



Pro Ala Val Ser Gin Val Ala 
130 135 



Gly Ala Pro Ser Gin Cys Gin Ala Gly 
140 



Thr Pro Gin Gin Gin Val Pro 
145 150 



Val Thr Pro Gin Ala Pro Cys Gin Pro 
155 160 



His Gin Pro 



<210> 34 
<211> 479 
<212> DNA 
<213> mouse 

<220> 

<221> CDS 

<222> (3) . . . (476) 

<400> 34 

at gag aag age cga aca atg agt egg tea ggc tec ttc egg gat ggg 
Glu Lys Ser Arg Thr Met Ser Arg Ser Gly Ser Phe Arg Asp Gly 



ttt gag gaa gtt cat gga tec tec ctg tec ttg gtt tec age aca tec 95 
Phe Glu Glu Val His Gly Ser Ser Leu Ser Leu Val Ser Ser Thr Ser 



tec ate tac tec acg cca gaa gaa aaa tgc cag tea gag att cga aag 143 
Ser lie Tyr Ser Thr Pro Glu Glu Lys Cys Gin Ser Glu lie Arg Lys 



ctg agg cga gaa ctg gat gee tec cag gaa aag gtg tct gcg ctg act 191 
Leu Arg Arg Glu Leu Asp Ala Ser Gin Glu Lys Val Ser Ala Leu Thr 



ace cag ctg act gca aat get cac ctt gtg gca gee ttc gag cag agt 
Thr Gin Leu Thr Ala Asn Ala His Leu Val Ala Ala Phe Glu Gin Ser 



ctg gga aac atg ace ate agg eta cag agt tta act atg ace get gag 
Leu Gly Asn Met Thr lie Arg Leu Gin Ser Leu Thr Met Thr Ala Glu 



cag aag gat tea gaa ctg aac gag tta aga aaa acc ate gag ctg ctg 
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Gin Lys Asp Ser Glu Leu Asn Glu Leu Arg Lys Thr lie Glu Leu Leu 



aag aaa cag aat gca get gec cag get gec att aat gga gtg att aac 
Lys Lys Gin Asn Ala Ala Ala Gin Ala Ala lie Asn Gly Val lie Asn 



cgc cac cag cca etc aag tgt g 
Arg His Gin Pro Leu Lys Cys 



<210> 36 
<211> 158 
<212> PRT 
<213> mouse 



<400> 36 



Glu Lys Ser Arg Thr Met Ser Arg Ser Gly Ser Phe Arg Asp Gly Phe 

15 10 15 

Glu Glu Val His Gly Ser Ser Leu Ser Leu Val Ser Ser Thr Ser Ser 

20 25 30 

lie Tyr Ser Thr Pro Glu Glu Lys Cys Gin Ser Glu lie Arg Lys Leu 

35 40 45 

Arg Arg Glu Leu Asp Ala Ser Gin Glu Lys Val Ser Ala Leu Thr Thr 

50 55 60 

Gin Leu Thr Ala Asn Ala His Leu Val Ala Ala Phe Glu Gin Ser Leu 

65 70 75 80 

Gly Asn Met Thr lie Arg Leu Gin Ser Leu Thr Met Thr Ala Glu Gin 

85 90 95 



383 



acg cca gag etc aac tgc aaa gga aat ggc agt gee agg eta cag acc 431 
Thr Pro Glu Leu Asn Cys Lys Gly Asn Gly Ser Ala Arg Leu Gin Thr 

tac gca tec gca gca aca etc etc cga cag tgt etc cag tat caa tag 479 
Tyr Ala Ser Ala Ala Thr Leu Leu Arg Gin Cys Leu Gin Tyr Gin * 



<210> 35 
<211> 22 
<212> DNA 
<213> mouse 

<220> 

<221> CDS 

<222> (1) . . . (21) 

<400> 35 



22 
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Lys Asp Ser Glu 
100 

Lys Gin Asn Ala 
115 

Pro Glu Leu Asn 
130 

Ala Ser Ala Ala 
145 



Leu Asn Glu Leu 



Ala Ala Gin Ala 
120 

Cys Lys Gly Asn 
135 

Thr Leu Leu Arg 
150 



Arg Lys Thr lie 
105 

Ala lie Asn Gly 



Gly Ser Ala Arg 
140 

Gin Cys Leu Gin 
155 



Glu Leu Leu Lys 
110 

Val lie Asn Thr 
125 

Leu Gin Thr Tyr 



Tyr Gin 



<210> 37 

<211> 7 

<212> PRT 

<213> mouse 



<400> 37 



Arg His Gin Pro Leu Lys Cys 
5 



<210> 38 
<211> 19 
<212> PRT 

<213> artificial Sequence 
<220> 

<223> Description of the artificial Sequence: Peptide for production of AK 
<400> 38 

Glu Lys Gly Glu Asp Pro Glu Thr Arg Arg Met Arg Thr Val Lys Asn 
5 10 15 

lie Ala Asp 



<210> 39 

<211> 81369 

<213> DNA 

<214> Homo sapiens 

<400> 39 



gatcagactt tgaagagtgt ttgtaccatg ctaaagttta cagaatttat tcctgctctt 
tgagggtgca ttgcaaatcc aggctagagg gagagatacc agttaggana gtacagcaat 
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actctactgg 


gaaatggtga 


ggtgtttcgt 


gaagacaatg 


gcaacacaga 


tgaagacatg 


180 


cagatggagg 


aaataaagat 


ccagttgagc 


ttgttggcca 


gttggataga 


ggttgaggtt 


240 


atgcatgatg 


gagcaatcta 


ggtttttgtc 


ttgggtaggt 


gtttccatga 


tagtactcag 


300 


aatgaatcat 


atagttgtac 


aggttgaatc 


ccacccatgt 


ttgcacaata 


gagtgactgt 


360 


ctagctgaaa 


tccagatgac 


actctgtatg 


ctaagctatg 


cttcatggaa 


ctgtataaag 


420 


gcacttgcta 


cataggctag 


tggcagatct 


ggaagtaacc 


tatatggtat 


ataggaaatg 


480 


aggtggcttt 


tgtataaatc 


ctacagataa 


atttcatttc 


ctgatcctat 


tattttgact 


540 


catgttagcc 


caagaagagt 


attcagtact 


tcatatccct 


gaaggtaaga 


cagagtagta 


600 


ttagattcac 


tatttggcaa 


ataaaaggga 


tcaagtccta 


agatcaagct 


gatgaatcaa 


660 


cacctcatag 


gatatgtccc 


aaccaattat 


atggcttccc 


ctataaataa 


aatctagttc 


720 


tcttctctgg 


agaggaacag 


tgaagaatat 


cataacctat 


gctacaaact 


gcttgagtag 


780 


gagctacttc 


tctccaaggc 


tttatatcat 


tcattctggc 


aggcccctct 


gtttgttctc 


840 


accagctcct 


gggaaattta 


tttctcctct 


agtgatataa 


aagctctctg 


tttgagatga 


900 


agggctgccc 


agtttatcag 


atctgtatta 


gtctgttctc 


aggctgctaa 


taaagacata 


960 


cctgagactg 


agtaatttat 


gaaggaaaga 


ggtttaattg 


actcacagtt 


ccacatggct 


1020 


ggggaggcct 


cacaatcatg 


gcgaaagact 


aataaggagc 


aaagtcacat 


cttacatggc 


1080 


tgcagacaag 


agagcatgtg 


caggggaact 


gctctccata 


aaaccatcag 


atcttgtgag 


1140 


acttgttcac 


tattacaaga 


acaacagaca 


ggaaaacccg 


ccccctcaat 


tcaattacct 


1200 


gccactggga 


ccctcccaca 


acacatgggg 


attatgagag 


ctacaattca 


agatgagatt 


1260 


tgggtgggga 


taccgccaaa 


ccatatgaag 


ttctttcttt 


gttactgggt 


accatatcca 


1320 


ttctgttgag 


gttctgagcc 


tttccagtta 


ctgtaactcc 


tctatctcct 


gtctgtgcta 


1380 


agactcagtg 


acctctctct 


gccttgcttc 


tgctttgtcc 


tgaccctttc 


tgtgcatgca 


1440 


ctcactctag 


tttgcccacc 


tgaggtgaga 


gatggtccag 


attagcaaca 


acaatctgtg 


1500 


gactaaaatc 


ctctttaggg 


aggaagcaaa 


attcagatgg 


atgttactaa 


acaaagctca 


1560 


gaaacagaga 


ccagggtgtg 


ggaagtaagg 


tagtagcctg 


agagcagctg 


gcagtgtttt 


1620 


agacctggag 


ggaggttagg 


tcatcagcaa 


tgaggagact 


gcctggaaaa 


tcctagaaaa 


1680 


ttaagacatc 


tggtcaggca 


aggtcatatc 


accagcacac 


ttcccttttc 


aagttgaatc 


1740 


cctttcctct 


gttaagagga 


ttcaagtgtc 


tttcttgcat 


tttgtcttct 


cttctatatc 


1800 
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catgcttgca atataaggag acagcagttg gctgtttgtg ctagaaaata taaatggcca 1860 

ttttgaaagc atgccagaca ggatctgcgg caagttttca atgttactgc tgccatctgt 192 0 

tgttcttcag tgctgggatg tgaatctctt ggcaaacatc tctctaattc tgaactatct 1980 

ttcaccccca tctagagata ttcacttact gaagtgcctt tttaaagcaa tgttcctcac 2040 

caaggcgatg ttctgaatgt tttaaaatgg aagaatctgg aatgttttta ttataataca 2100 

ttttgtatat cccaaagcaa aaatcaattt cttcatggtt aatacttttg taattttgtt 2160 

tttaataata ttttcctttt aaatataaga aatattttat tgaattaata ctttaatgta 2220 

gctgtttcaa gtaagataaa acagaacaga ttactgtttt caaccttgtt cacagttagc 2280 

tctgtaacta agttgttgag ctttatctaa gcttttttat ttttacataa cgtttccctt 2340 

ttcacttaac cttgaaatta tagtaatttg ggaacttcta ttcctctgaa agagaaagct 2400 

aatgccaaag atatttcaag ggagaaagaa ggtttttaaa aggagagaca attcagctca 2460 

gacttaatag ctgtgattgc tatttattaa gcagaacgcc tataactaaa ttctcagata 2520 

tccaaaaaac agcctgtaca ttctcaaaag tgaagattac acattttcta agttaaggta 2580 

aaagttttgt ctctgtagca tcttactgat ttctatcttc tcattctgcc ttaataatgt 2640 

cactaaataa atgtttgatg cactaataca tgaataaaac tattcatggt aatgattctt 2700 

tagaaacaca gctaagtttt gtaattttgt tttttaaaaa ttaaaaattt aaatataaaa 2760 

atgtttttaa aaggcttgaa tttcttgtta aatgtacaca ttttaagttg taggctgtct 2820 

ttaaaaataa tctctccaca cactgtagta tttaaaacat catgatatta ctataaaaca 2880 

tcaacaaata gggcagtgga aaacatggta atcactaaaa atgctcacat gtcatatatt 2940 

aagacttgat aagtaaacca caataataaa tagaaaagaa atagttgtct aaaaagggat 3000 

tctcaccttt caaaccttac cataaaaatg gaatataaaa gaaggaagag gaggagaaat 3 060 

caaattatat cataaaattt tctgggcaaa aatattacag aagaaaataa gaaagattta 312 0 

tggagttgac tgaaacattt ttgaatccta tacataaaaa tatcgttaat taaaaggaaa 3180 

aacaaagaaa cagatttggg aaatatttga aactggtttt tttttagcat ttaaaaatgt 3240 

aatacaaatg gattatttaa actccattgc aaaaatacac aaaggacatt gacaatgtct 3300 

ggaaataaaa ttagctaagt aagttataga aaaactcagt ctcacaattt gacaaatgta 3360 

actgaaaact attaatataa ttagtaacta tttttacatg tcaaaatttt tgaattacta 3420 

aaggaaacca caatgcctga aagtatccag ggtttttttt tttttttata atattggcac 3480 

tgtcatatgg gtggcaggaa ttgaagtgat gttgtttctt cagttattaa gttgcatctg 3540 
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cagtgtttca aatgtccaaa acctgtgagt cagtaattct ctttttgtat atttatccta 3600 

atacaataat tctaaacata atctcaatat atatgtacaa agttattcac tgcagtgtta 3660 

cttacaatag ttagaaaatt gtaaaatgct ttatgcatct taaaatataa attgttgaat 3720 

atataatagt ccatatgata taattatatc attattataa ataatgaatt agaaaataat 3780 

ttaagagcat taaaataatt ataaggtaat atgaagtgaa tgaataatgt acagatacta 3 840 

taatcagcag agtgttaact aggtaaattt ttatgtgtgt atatactact tcctaaaaat 3 900 

gacttgacag aaatcatcaa aatgctaatg gtggttactt ctgggtggga atacagatga 3 960 

tttactttgt tccttttatg tatttctgca ctgcccagtc ttccacagtg agcatatatt 4020 

ggtttttaaa tttatataag atggaaaaag ataccaaatg gtcttcaatg aatcctggag 4080 

ttaactttca tgtgtgtcat atgttatatt ctaaacttat cacaaataga agactttaaa 4140 

tcaacttgta cctatttcaa ctatataaca gcatctttaa aatgagcatt gaattaaact 4200 

accaaaacca accatcatga ggattattca agtaatgtgt ttaaacaaaa gaatttgtaa 4260 

taaaattact ttatctcctt tgtgatttca gcccatttaa aaaaaataga tgtttctact 4320 

ctccttcaga tatcattaaa acataaactt gtgcctgact gcataaatcc cttttaaact 4380 

aatatcactt attacgttta actaagtcta cctagggctt ccttgtataa agaacaagag 4440 

ctttccattt tttgtttacc tagccctttc tgatgccacg acagaatagc tgtaaatctt 4500 

cattatttat attctagaga aaataaaagc aaataaaaag gtcagtgtat aaagtttatt 4560 

ggttgttctc tttactcaaa acccacatgg tattaatgtt agtctctatg aatatttcat 4620 

ggataaaatc agagcattaa gtgcatacta aaaacaataa gaatggaaag actttaacct 4680 

tatgtttata tgaatttcta ggttatcaag aagtttatag gctataggct ataaagtctt 4740 

aggctatgat atagtaacct aatgtagact tcccttgata catgaaaata atggtactaa 4800 

gtacaaacag aagatgagct taaaattatt ctttgagtcc tcttgatgga ttttttcccc 4860 

cacactttcc ccaaaattgt tttatgccta tattgtagga gaccatgcaa gagacctaga 4920 

gtctcttttt ctttcatcac tttccaatca acagcaaatc ctatcatttt taccacaaaa 4980 

tatatcttga aactcccttc ttttgattta cttgtaactc cccatcaaaa actgaagagt 5040 

gtcacaatac ttcattaagt tccctacttg cactctacct ttaatatatt tgtagcacta 5100 

aaatgttttt aaaacatata tctgcttatg tcattttact gctcaatact atctgatttt 5160 

ctattgcact tctaagatac tctaatttct tagcactcta tataaaatcc tttaagggct 5220 
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\~ccci~cicf'c^ 


cc 1 1 1 1~ caaa 


c t cagaac ta 


tgtatttcct 


t ttgectget 


gtacttgtac 


5280 


cactggattc 


ttgatttttg 


t tacttccag 


gtttttacac 


ttatttttac 


aataaatgtg 


5340 


aaataccctt 


tttgacaata 


tctacaaata 


tttcttattt 


gtctttattg 


ctctttcctg 


5400 


taatgtttag 


tcttcatttt 


cctqataatg 

ZJ — ' 


gctatctaaa 


gttatctcct 


caaagaagca 


5460 


gttatttatt 


cacccaaatc 


ttctagtcct 


tctctggagt 


tttcttctca 


cttcattccc 


5520 


ttggtttttg 


ccacaat ttg 


taataatttg 


caatttggag 


tgttagaatg 


agggaataaa 


5580 


tcacaggt aa 


tgac tatagt 


ttgtgactat 


gtaagattgg 


attegttatt 


gatttattcc 


5640 


acaaacactg 


aggcactgca 


tttagccaaa 


tgecaatett 


gggcagtgag 


actctgaaag 


5700 


agaatc tgc t 


tcccccacca 


taaactacaa 


agtgaaacaa 


ctcagaatgt 


acataaatta 


5760 


cagaatgaaa 


gcacac taga 


agtaaacaca 


qatqtqqaaq 

ZJ ^ ^ ZJ ZJ ZJ Zj 


aggtaaagtg 


tccttgaaaa 


5820 


t ca t ggaaag 


at t cataaag 


ggaatgacat 


t tcaactgga 


ttctaaacca 


gttattcaag 


5880 


c t ccacaagg 


t tgc a c agt a 


aatgagcagt 


qqcaqqatqa 

ZJ ZJ ZJ ZJ ZJ 


cataccttag 


aaagtaaaag 


5940 


gaatcttt t t 


taaactgcta 


taaaaatcat 


tacatataca 


ttttgtaggt 


cgagagtaag 


6000 


gtatttaaca 


taaaatcatt 


ttagtatatc 


agtgtttata 


tagacttagg 


tttttctcat 


6060 


ttaaaacctc 


ttt taatgac 


ttgtgctttt 


cttcatggta 


ataaaacatt 


ttcccaggaa 


6120 


gtgctgaata 


aatctttct t 


gaaatacgtt 


ttattgcttt 


ctatcaatga 


ccctgaagta 


6180 


atacagaat t 


tacac t t cag 


cQQttacaat 

^— ^ zj zj — zj 


gctcaaactt 


gacaggtaat 


gcactgtgtt 


6240 


tgc t gat at a 


agaggt atga 


tgtagggcta 


aqtqqttttq 

3 } Zj ZJ Zj 


tgctcattta 


gctttcagga 


6300 


gaaaataat t 


gacttaacat 


tttgatacta 


aaacccaaag 


cctaacagtt 


aattcttggt 


6360 


attttaaatt 


at tattgcaa 


agattattgt 


gecgaataat 


atgaaaatat 


tttatataat 


6420 


att taaaaag 


tatatct ctt 


tcttggtatt 


atttaaatta 


ccataaaaat 


gtgcgaaaaa 


6480 


gt t atactga 


aatgtgatag 


gatcttttaa 


aagtggtgcc 


ttgattttgt 


taagtgttac 


6540 


c t agt 1 1 1 cc 


t c tgaaaaca 


agaaacat ac 


ccagaagt tt 


tcacgaaatg 


gtctcatgaa 


6600 


tat c taaggt 


t agt c eg tag 


tc teat ctga 


gacaaggaaa 


gtcccttcca 


etatgagect 


6660 


gtaaaatcac 


aagcaagcta 


gttacttcct 


agatacaatg 


ggagtactgg 


tattgggtaa 


6720 


acacagctgt 


ttcaaatggg 


agaaattggc 


caaaattaat 


gggttacagg 


geatgeaatt 


6780 


ccgaaatcca 


tctgggcagt 


caaattgtaa 


aactccaaaa 


tgatntcttt 


tgactccatg 


6840 


tntcacatcc 


aggacatget 


gangcaagag 


ataggttccc 


ataatctttg 


gcagctctgc 


6900 


ccctgtggct 


ttgcagggta 


tatcacccct 


cccagctgct 


ttcacaggct 


ggcattgagt 


6960 
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gtctgtggct 


ttcccaggaa 


caaggtgcaa 


gctgttggtg 


gatctaccat 


tctggggttt 


7020 


ggaggatgat 


ggccctcttc 


tcatagctcc 


actaggccgt 


gctccagtag 


agactctgtg 


7080 


ggggctctga 


ccccagattt 


ccctcctgca 


ctgccctagc 


agagattctt 


catgagggcc 


7140 


gtgcccctgc 


agaaaactct 


ttcctgggca 


tccaggcatt 


tccatacatc 


tgaaatctag 


7200 


gtggaggttc 


ccaaacctcg 


attcttaatt 


tctgtgcacc 


tgcaggctct 


ctaccacgtg 


7260 


gaagctgcca 


aggtttgggg 


cttgcaccct 


ctgaaaccac 


aggctgagct 


ataccttggc 


7320 


cccttttagc 


aatggctgga 


gtgactggga 


cacagggcac 


caagtctcta 


ggctgcacac 


7380 


agtatgggca 


ccctgggccc 


agccctcaaa 


atcatttttt 


cctcctaggc 


ttctggatca 


7440 


gtgaagggtg 


gggctgccat 


gaagacctat 


gacatgccct 


ggagacattt 


tccccattgt 


7500 


cttggggatt 


aacactggct 


ccttgttact 


tatgcagatt 


tctgcagcca 


gctgaatttc 


7560 


tcctcaaaaa 


atgggttttt 


cttttctact 


gcattgtcag 


gctgcaaatt 


ttctgaactt 


7620 


ttatgctgtt 


tcccttttaa 


aatgcgatgc 


tctaacaaca 


cccgtcacct 


cttgaatgct 


7680 


ttgctgctta 


gaaatttctt 


ctgtcagata 


ccctaaatca 


tctctctcaa 


gttcagagtt 


7740 


ccacaaatct 


ctagggcagg 


ggcaaaatgc 


caccagtctc 


tttgctaaaa 


cataacaaga 


7800 


gtcgcctttg 


ctccagttct 


cagcaagttc 


ctcatctcca 


tccgagacaa 


cctcagcctg 


7860 


gtccttattg 


tttatatcac 


tataaaaatt 


tttgtcaaag 


ccattcaaca 


agtctctact 


7920 


ccaaactttc 


ccacattttc 


ctgtcttctt 


ctgagccctc 


caaattgttc 


cagcctctgc 


7980 


ctgatacaca 


gtcccaaagt 


tacttccaca 


tttttggata 


tcttttcagc 


aatgccccgc 


8040 


tctactggta 


ccaacttact 


ttgttagtcc 


gttttcacac 


tgttgataaa 


gacataccca 


8100 


agactggaaa 


gaaaaaaagg 


tttaattgga 


cttacagttc 


cacatggcta 


gggaggcttc 


8160 


acaatcatgg 


caggaggcaa 


aaggcatttc 


ttacatgatg 


gcagcaagag 


aaaatgagga 


8220 


agatgcaaac 


gcagaaatcc 


ctgataaaac 


catcggacct 


tgtaagactt 


attcactacc 


8280 


actaggacag 


tatgggtgat 


accaccccca 


tgattcaaat 


gatctccaac 


caggtgcctc 


8340 


ccacaacaca 


tgggaattat 


gggaatacaa 


ttcaagatga 


gatttgggta 


gggacacaga 


8400 


gccaaactat 


atcacatgga 


tttcttatac 


ttttgctttt 


aataacacaa 


acaaaaaaat 


8460 


acatcattaa 


aaggttagaa 


gtgagaaggt 


gtttttatgg 


aaatcaaaaa 


taatatcacc 


8520 


ttagtgaaca 


gtattcttat 


gattgtagtt 


gaattagaga 


gcagaataca 


tctagaagat 


8580 


tcagtagtaa 


gcatgtttct 


tcgattaatg 


gaaaatttga 


atagcctagc 


tgattgagat 


8640 
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tgaggttact 
gttttacatg 
gcatgactcg 
aaaattacct 
aataattgtt 
gtccttgaat 
agtgaatcca 
atctaaaaac 
tatgtgtatc 
tatcatgtaa 
atgcaactgc 
tccttttctt 
tctttaactt 
ttttaattgt 
cacaatgctg 
tcaaataatt 
tacatttatg 
taatacataa 
ttttattatt 
tcttcattat 
tctcctaatc 
actgaattac 
tattacttgt 
ttgggatatt 
atggacaagt 
ggattttcta 
tgtttgcaaa 
ttattgcttt 
agttctcttc 



attaaatgcc 
tacatgtgta 
aatacctgcc 
acaaatggca 
tagttaatat 
gtataacaca 
gaactacaaa 
taaaaggcat 
caaacaataa 
aatttgtttt 
taccacagtc 
aaatgttgca 
aatgcgcttg 
taagaaaaaa 
agtgtttgtt 
ttaaaatatt 
acagtatggg 
aatattcttt 
gtggttgaat 
ttactgtatg 
tataaagtgg 
tatagaaaat 
ttattatcta 
tgaaactttt 
tggtattatt 
tcacatttta 
atgtaaaaaa 
gtgatttgaa 
ttttactcat 



tgaagtataa 
agtaggactg 
atatgaagtg 
ttttcctcaa 
ttcagaatcc 
tttgttttca 
ttaactaaga 
gcctcaacaa 
tcttccagca 
attattgttc 
ttcctgatca 
taaataaaac 
aaattaatct 
tgtatgctgg 
taggatagag 
tgatacatat 
ataaatgtgt 
gctaatttga 
gactgtactg 
tgaaaattta 
ggataattgt 
gcttagaatg 
acacgtttaa 
cttagtgaat 
cttactggtt 
gcaaactttt 
aacaaaaggt 
aattcttgtc 
tctgaagtca 



gagttggttg 
ttgagcccca 
ctattacatc 
atcaatttta 
ctcatcataa 
aacaagcctg 
ttggccccat 
ttattttctt 
gtttcgctag 
agttctgaat 
ctgatctgtt 
cataccttat 
atgtcatttc 
gataaatatc 
tcctagaaat 
tgccaaataa 
ctttcttata 
tgggacagaa 
tacagccaga 
ggtgagctat 
accaatcata 
gtatctagtc 
ttattaatga 
tttccaataa 
tgtttcaggt 
tctgcatttt 
ttcttcatca 
ctgagaacca 
ttggaattga 



tttatgtaaa 
gtaacatgaa 
aaaaaagagg 
aatcttcaga 
aaagcaggca 
cctctaactg 
cgagttactg 
cttggaatca 
ctacattttt 
tttgacatat 
ctaaatctct 
gtggcctttt 
atgtatcaat 
tttctaaatg 
ggtatcacta 
tctcaaattt 
ccaactgaca 
atgttatatc 
gatatttggt 
ttaatctctt 
ttaggttcct 
accaggaagg 
agctcagttt 
aattatttgt 
tcagttagta 
atcttttttc 
agttggtatc 
aaatatatat 
atttggcata 



caaaatatct 
atatcaaaga 
cgtgtgctga 
atttcatttt 
aaaggcaaaa 
tgaatccagg 
aacgttaaaa 
ttaattaacc 
aattacttaa 
gcatcaagcc 
atagcatttt 
gaatctggca 
ggctcaatct 
agtttttgtt 
ggtcaaacat 
tttaccaata 
acattaatga 
cttattagca 
tcaaaatcca 
gatgccttag 
gtgagaatta 
actctctctg 
cgttatatgc 
ctatttttct 
agaattttaa 
tttcagataa 
tttatctttt 
ttgatgaaat 
tgatataaat 



8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
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cctaatttta 


tattttatga 


tattcaaaat 


ttctaacaaa 


tatttactta 


ataatctaat 


10440 


ccaggtttct 


attgtttctt 


ctgtttcctt 


tataatgctt 


tttctgaagt 


tatttttcct 


10500 


agacttaaat 


attagtataa 


tattatcata 


gaggaaaaaa 


tatctgttag 


ctatgaataa 


10560 


aaggctttca 


tcttattgtt 


gcattaatat 


atttaaatgt 


agagagcata 


cagattagca 


10620 


aagaaaaagt 


ataattgcct 


ttttttatag 


ttgacatgaa 


catgtataaa 


gaaaaaccaa 


10680 


aaaaatcaat 


aaaacaacta 


gaacttatta 


gtgaatttag 


caagatcata 


gcatacaaag 


10740 


ccaagattca 


aaattccatt 


ttatttatct 


actaacaaaa 


aatatttgaa 


atttgaaaat 


10800 


ttaaatatgc 


catttacaat 


aacatcaaaa 


tattgaacaa 


taaagtattt 


aggaatttat 


10860 


aaaatgaaat 


ctcctatacc 


aggaattaca 


gaccattgct 


gaaataaatg 


aaagaagacc 


10920 


aatatatgtg 


aagagatact 


catttgtgga 


ttgagagaca 


atattgttaa 


agtatcagta 


10980 


tttcccaaat 


taatcaatag 


attcaatata 


atggtgaaca 


gaacaccaga 


agatgttctg 


11040 


tcgaagctga 


caagctattt 


ctataattca 


aatggaaatg 


caaaaggcag 


tcactgccaa 


11100 


caccagcatg 


gactgtctgg 


gttccagtag 


gttacttcac 


tactgcctct 


tctgtcagcc 


11160 


acatcacgac 


agctgcccag 


aagccagaga 


aactcctcac 


acctggccca 


ctgctgcagc 


11220 


taccagcatc 


caggcaagcc 


accatcagcc 


cactggtaac 


tgccaacaga 


ggtaccactg 


11280 


tacactaccc 


tggggaacaa 


agataggcat 


gtagtcagcc 


cacctctgcc 


accactaggg 


11340 


cctgaagcct 


ggcccacctg 


acactgcagt 


cctcagcaca 


gcttcatcac 


agcttctgtt 


11400 


aataaccaca 


ccctaaccta 


ccaaggaaat 


cacaaatgtc 


actgacactg 


tttgtagcca 


11460 


aagaaatcat 


agagagacta 


cattactgca 


cacacccata 


atcaaagcca 


cagtacccta 


11520 


tccagacaac 


atcacaggta 


tatctaaagg 


aaaaaatttt 


cccatatgaa 


agcgaattca 


11580 


aatataggaa 


gaagcgactg 


ttacaacaga 


tatgcagata 


aagcttcaac 


aatatcctac 


11640 


attcaaccag 


aagaaagaat 


ctcagaaggt 


aaagacaggt 


cttctgaaat 


aatctagtca 


11700 


gacaaaatta 


aaagagaata 


atcaaatcct 


tcctgacatt 


tgggataaca 


ttaaagtgac 


11760 


caaatatacg 


aattatagat 


acccctgaga 


gtgaaaagac 


aaagaaaaga 


ttagaaaacc 


11820 


cacttaatta 


aataatatat 


gaaaacttcc 


taagtctagc 


aagagtttta 


gatatttggg 


11880 


atgcaggagg 


ctcaatggtc 


cccaggccga 


taaaacgcaa 


aaaggtctta 


tacacagcac 


11940 


attacaatca 


gactgtttaa 


agtcaaagat 


aaggaataaa 


ttctaaaaac 


agcaagagaa 


12000 


agtgtatgat 


aacctatgaa 


gtaaacctta 


tcagactgac 


agcaaatttc 


tggcagaaac 


12060 
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tttacaggcc 
cagccttaaa 
ttcccagaca 
ctcaagggag 
gaaagtataa 
aatctaacta 
acaacaaata 
aatgaattac 
caaatatatg 
aatggtaaaa 
aattagatca 
tatataatga 
aacactagag 
aatacagtaa 
caaaaaatca 
gacatttaca 
aaacaatgtc 
taaaagttga 
atgccaagaa 
ttgatcactg 
agattgaaac 
gtttatagca 
caatgtgcct 
acacaaaaat 
ttaacaaaac 
gactaaacaa 
acaacagatg 
gaaaaatgga 
ttgcatcagg 



agaaagaata 
tactatagcc 
cgaaaatgct 
gtctggaaac 
aactccctgg 
accacaatga 
aacaacaata 
attctccacc 
ctgattacaa 
gatattcctt 
tacagacttt 
tgagattaac 
aactcagatc 
tagtggagaa 
accagtaaat 
aaacattcca 
caagatagac 
aatcatatca 
taacgttgga 
ggacaataag 
ataacatatc 
ataaatgctt 
gaagaaacta 
aaagatcaga 
taaaatttgg 
ggaaaaagaa 
ccacagaaat 
aaatatagag 
aagaaataga 



ggacaatata 
cacaaaatta 
gaggtagttt 
tggtagtgaa 
taagcaacta 
caaacaataa 
taacaggaag 
taaacgttat 
gaaacttacc 
gcaaatggaa 
aagtcaaaaa 
ccagcaatgg 
cacaaagcaa 
cttcaacact 
tttagattta 
tccaaccact 
caccatatgt 
catatcttct 
aactatacaa 
gaaattaagc 
aaaaccagtg 
acactgaaaa 
aaaaatcaag 
aaagaactaa 
ttattcaaca 
tatccaaata 
aaaaaggatc 
aaatagataa 
aaacctgaac 



ttcaaagtgc 
tccttcataa 
gttactacta 
aggacgacat 
aagggaggta 
gggaaaaaga 
cctcacatat 
gaaatgcctg 
aggcagacat 
agcaatagtg 
gagtaaaata 
gaaataacaa 
atattagacc 
ccactttcag 
aactagattt 
gcaaaatgaa 
taggccacaa 
cagaccactg 
atacatgcag 
tgaaaatcaa 
gcatacagca 
agtagaaata 
aacaaatcaa 
atcaaataga 
agataaataa 
aacacaatca 
atcagagact 
attcctagaa 
atatcagtaa 



ttaaagaaaa 
atgaaggaga 
gactggacct 
ttatcatcat 
tcaaatgtta 
aaggaacaaa 
cagtaatcac 
aatgataaaa 
acataggctg 
agcaggagta 
aaaaagacaa 
ctctaaatgt 
taaagagaga 
tattagacag 
tagaccaaat 
atttgtgtca 
atcatgtctc 
ttgaataatg 
attaaacaac 
aaaattcttg 
aaagcagtgc 
ttttaaaatt 
acccaaaatc 
gactaaaaaa 
aattgataaa 
aaaacgataa 
attattaaca 
acttacaacc 
tgattagcaa 



aaaaaactat 
aataaaaggt 
acaataaatg 
gaaaatacat 
ccaccagaga 
aatatataag 
tttgaatgta 
ctatatgatc 
aaagtaaaag 
gctatactta 
aggatgttat 
atatgcattc 
aatagactgc 
ataatctagg 
ggacctaaca 
tcagcacatg 
agcaattttt 
ctagaaatca 
atgttcctgg 
taacaaataa 
taagagggaa 
agcaacctaa 
agcagaagaa 
atacaaatga 
ccgctagata 
aggagacatt 
actatatgct 
taccaagctg 
aattgaatca 



12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
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gtaataaaaa 
ttctaccaat 
gaaggaattc 
aaggacacca 
cattaacaaa 
tccagagggt 
ataagcatct 
aaaaattttt 
aagaaacata 
agaatggggc 
tctcaccact 
agaaataaaa 
atggttttgt 
taatttagta 
cccaataata 
aaaaattaaa 
aacactgatg 
gatcaaaaga 
tgcaatttct 
gtatggaatg 
ggtcctgact 
aatttgtaca 
cagcatggta 
atatattgtg 
aatgttttct 
atataggtta 
tgcagcaaca 
agataaatat 



acatctccca 
catgcaaaga 
tctctggctc 
caaaaagaaa 
atactggcaa 
ttgtatcaag 
tcacagaata 
gataaatttc 
cctcaatata 
aaagtttaaa 
cctatccaca 
agtatctaaa 
atctggaaaa 
aagtttcagg 
aaatagctag 
atacctggga 
aaagaaatta 
attaatatca 
accaaattac 
agaacagagc 
taatcactat 
aataaaaaaa 
tttttagaca 
tgtgtgtgtg 
acatgttcta 
tatagaattg 
tggttgaaac 
cacatgttct 



actcttttaa 
agaataccag 
attctacatg 
actacaggcc 
acggaatcca 
gatacaagta 
taagacaaat 
aacatctctt 
ataaaggcca 
gactttcctc 
tagtactaga 
ttgagaagag 
tactaaaaac 
atacaaaata 
gaaagaaatc 
ataaatcaag 
aggattaaac 
ttaaaatgac 
caatgtcata 
ccaaatagcc 
gcaatctatg 
tgtaaaaaaa 
aatggaatgg 
tgtgtataca 
atatttatat 
gaagactatc 
tggagttcat 
cacttatatg 



agctttggac 
tcttcttgat 
accagtgtca 
aataaccatg 
acagcacatc 
tgactcaatg 
gaatatatga 
catgaaaaaa 
tatgtgacaa 
taagaactgg 
agtcctagcc 
caagtaacat 
tccagcaaaa 
aaaatacaaa 
aagaaagtga 
gaagttaaag 
aaacaaattg 
catacttccc 
tttcatagaa 
aaagcaattc 
catgtaacaa 
gaacaaagct 
aatagaaagc 
cacatacatg 
tccattccat 
tgccattaaa 
tatcttaagt 
tgggagctaa 



caaatagcat 
gctattacaa 
ccttgaaacc 
atgaacacag 
aaaaaaataa 
taaataaatc 
tcatctcaat 
atctctaaaa 
actcagagct 
aacaagacaa 
aaaacaatca 
tgttcctctt 
acctcttaga 
agtcagtagc 
tcccatttaa 
atctctgcac 
agaaacatcc 
aaagcaattt 
ttagaataat 
tgaacataaa 
aattgaacat 
ggaggctata 
tcagaaataa 
tatatataat 
tatacatatt 
aagaatgaaa 
gaaataatct 
taacttgatt 



cacagcctaa 
taaatcagag 
aaaacctgac 
atgcaaaaat 
tataccacaa 
aataaacatg 
agatgcagaa 
ctcagcatag 
aatatcatac 
ggatgcaaac 
gacaagcaaa 
tgctgatgat 
tttgattaat 
atttctatgc 
attagctaca 
aaaactacaa 
catgtttatg 
ccacattcaa 
cctaaaatta 
gaacaaatct 
ggattttatc 
gtagccaaaa 
agccatatat 
gtgtacatat 
ccatttctgt 
tcctgtgatt 
aggcacaaaa 
acatgaaggt 



13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
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ggagaatgga 
aaggtgaaga 
cttgatagca 
cctaaatact 
cgtgtaaaat 
gacaatgtta 
tttatgttaa 
ctatcaaaac 
tacatgtcac 
cgttagtcat 
acagagactg 
tgtggtaaga 
tcacccaaaa 
tctcttgata 
tgtgtatatt 
cattgttcaa 
tataaatgta 
aggaattaaa 
caggctatta 
acttgaaact 
ttgctagata 
ttctcatttg 
tgttcgtaag 
cattctatta 
gaaagaaaga 
ataatcatgg 
gtaaaagggg 
gaacacaatt 
acacatgggg 



aaggtaggta 
gaagagagtt 
gagtacagtg 
tgatcactat 
taaatgcgta 
actcaggtgg 
tgccatatag 
tcaataatat 
caaaaaatat 
cagagaaata 
acaataccaa 
atgtacaatt 
tatatgtctt 
tatgtctttt 
tgtgttttta 
catgtacaaa 
tttattggtg 
ctatagtaga 
gcgaatcata 
tgaggccaga 
atagaaaaga 
tgacctcttg 
agagtctggg 
gtctgttctt 
ggtttaattg 
gggaagggga 
gaaaagcccc 
agagtaactg 
cttatgggaa 



ggaaacagag 
aaaaggtgta 
actacagtta 
gcaattatat 
caaataaaaa 
ataggcatta 
aatgaaataa 
taagacaacc 
attcaaattt 
caaaataaaa 
gtattgacaa 
atataaccac 
ttggaaaaaa 
caaagaggct 
taatttatat 
tataagaaag 
ttccacgttg 
aggttctcat 
gcactagggc 
tatctgccca 
atcaaatgac 
cctataaatg 
aatgtgagct 
aaactgctaa 
actcacagtt 
agcacacatg 
ttataaaact 
cccccatgac 
ctacaattca 



actggaaagg 
aacatatagt 
acaaaatgta 
acgtgtaaca 
taataaaata 
agtcaatact 
aattcactaa 
caataaaaat 
ccaataaata 
tggtaatgag 
ggatatggag 
attgaaaaaa 
ttttttccag 
gagctttact 
gcatatattc 
atatagtaaa 
caaactaaat 
ttattgggat 
ttcactgcta 
tgctgataga 
tctgccacat 
atagatagtc 
ttttatatcc 
tgaagacata 
caacatgact 
tccttcacat 
atcaaatccc 
tcaattactt 
agatgagatt 



atgaatggag 
taaaagaaat 
ttatactcag 
aaatcactat 
ctaatccagt 
actataagaa 
aatccaaaaa 
gtggtcaaag 
catgtaacaa 
atactactag 
caactgaaat 
caagttttca 
tctgtgggtt 
ttagacagtg 
ctgtgaaaag 
gaaatatata 
aatctacgtt 
gattagaacc 
cctccactga 
aaacaactga 
tgcttgccag 
cctgtgctgc 
tatttttggg 
ccccaaattg 

ggggaggcct 

ggtagcagga 
atgagaactc 
cccaccaggt 
tgggtgggga 



ggtaggaggg 
aaattcaatg 
gtgatgaaca 
gcactatata 
atcattcact 
ccacttcttg 
attagaaaaa 
gatttgaaca 
tgttcgacat 
ataggctttt 
tctcattcct 
gtttctttat 
gtcttctcat 
gtcatcaaag 
atactgtatg 
tttctaaatt 
ggctaattta 
agcctttttg 
cacctctgac 
ataatttaat 
aagattgttt 
atgctatagg 
tggtaaaggt 
ggtactttat 
aaggaaagtt 
aggataatga 
actctcacaa 
ccctcccaca 
cacagccaca 



15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
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ccatttcatt 
atgcccttcc 
agtccaaagt 
aagcaagtta 
tccaaacggg 
caatagggcc 
cgtctaggtc 
tggctttgca 
cagcttttcc 
gatggtggcc 
ggctctgatc 
acccctgcag 
ggcggaggtt 
ggaagctggc 
cccttattag 
cagcaggggg 
tgcgatgagt 
tcttattaag 
cctcagaaaa 
tatgctctgc 
taaaactgaa 
tatcttctac 
ggcaggggca 
agttcccaac 
tatcattatc 
cccacatctt 
agttgcaaag 
caatttacca 



ccacctctga 
aacagtcccc 
ctaatctgag 
gctacttcct 
aggaattgac 
attgttaaac 
atgattatgc 
gggtacagcc 
aggcacatgg 
ttcatctcac 
ccacatttcc 
caaacttctg 
tccagacctc 
aaagcttggg 
ttaaagctgg 
gccctggacc 
agggttgcca 
atttggctca 
tgagtttttc 
ttccctttta 
tgcttataac 
cagataccct 
aaatgccacc 
aagttcctca 
agcatgttgg 
cctatctttt 
ttactgccac 
tatccattta 



cccctcccaa 
ccaaagtctt 
acaaggcaag 
agatacaata 
caaaaccaag 
cttaaagttt 
aagaggtggg 
ccactccagg 
gtgcaagctg 
agatccacta 
cttccacact 
cctgaacatc 
aattgttgac 
gctttcacct 
agcagctggg 
cagcccacaa 
caaaactgtc 
tagttactta 
ttttctatgg 
caattaagtt 
agcacccaaa 
aaattatcgc 
agtctctttg 
tctccatctg 
tcaaagccat 
tctgaggcct 
atttctgggt 
ttctcatgct 



atctcgtgtt 
aacacatttc 
tcccttctgc 
gggtcacagt 
gggctacagg 
caaaattatc 
ctcccacagc 
ctgcttttac 
taggtggatc 
ggcagtaccc 
gccctagcag 
caagcatttc 
ttctctgcaa 
tctgaagcca 
ttgcagggca 
aaccaatttt 
tgacatgcct 
tgcaaatttc 
catcatcagg 
ccaattccaa 
tcatatcctg 
tctcaagttc 
ctaaagcata 
agaccacctt 
tcaacaagtc 
ccaaactgtt 
atctttacag 
gataataaag 



ctcacaattc 
agtattaaca 
ctatgagcct 
cattgggtaa 
cctcatggag 
tcctttgact 
tttgggcagc 
aagctagtgt 
taccattctg 
cagtggggac 
aggttcacca 
cttacatcct 
atgtaggctc 
tggccttagc 
ccaagtccct 
ccctcctagg 
tggagacatt 
tgcagcaggc 
ttgcaaat tt 
accatatctt 
aacactttgc 
aaagtaccac 
acaagagtca 
agcctggatt 
tctaggaagt 
ccaacttctg 
cagtgcccca 
acatacccaa 



aaatacaatc 
caaaagtcca 
gtaaattcga 
atacacacat 
gtccaaaatc 
tcatatctca 
tctgcctctg 
tgagtgcctg 
tggtctggag 
tctgtgtggg 
tgagggctcc 
ctggaatcta 
aacaccccat 
tgtaccttgg 
atggtgcata 
cttctgggcc 
ttccctattg 
ttgaatttct 
ttaaaacttt 
tctggataca 
ttctcagaaa 
agatctctag 
cctttgctcc 
tcattgtcca 
ttcaaacttt 
cctgttaccc 
ctcctggtac 
ggctgggtag 



17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
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tttataaaga 
gaaacagaat 
agaatgagca 
ctcttgagaa 
tcccatgaca 
cacagccaaa 
ttttaaaaga 
ttttgctctt 
attatgatgc 
aatcagcctt 
agttgacttc 
tgaagtacat 
tcaccatttc 
ttgcaggagc 
aatgtgttta 
agcagtgccc 
agactatata 
tttcattgtg 
taatctaatg 
cctatttaat 
acacttcagt 
aataacatat 
accatcgtgt 
aacaggctat 
gtacactcta 
tgtccctgtc 
atattcagaa 
gcacgggcat 
gagcaaggtt 



aaaaagaggt 
catggtggaa 
aaacggggga 
cagcatgagg 
cgtgaagatt 
ccatgtcagt 
tttgagcaac 
attaaaaata 
attatgctct 
aacaataatt 
atatgtgatc 
atatgatgta 
tttgtagttt 
cttttaaatc 
ttcagtgtgc 
cactcctggc 
tcacaataaa 
gagttctcat 
cattcactaa 
ttatccttct 
catttggtca 
ttttcctgta 
tagagttgcc 
accacatacc 
tgatgttctc 
attaatacag 
ttttagctat 
ttcattttag 
atgacatggc 



ttaattgact 
gggaagcaaa 
aaaaccctta 
gtaaccatgt 
atgggaacta 
catgatatga 
cacaaatgac 
taattgttct 
cagcagctaa 
ctataaattt 
taaatacatt 
gctcttaaag 
gtgctttcat 
catgtgtcca 
atggattaat 
accaatttac 
ctgagaacca 
tttccttagc 
atgaaggtgc 
ttagattaaa 
ttgaaggacc 
cattttcatt 
tgcagtattc 
ttaggtgtat 
acaatgaaca 
tatgtaacaa 
ttctcttata 
taaccacagt 
cagatgtgaa 



cacagttcag 
cacatcctcc 
taaaatcatc 
ccatgattcc 
ctacaattca 
gaaattatca 
agatatgtgc 
tgataatctg 
aacttcaagc 
aattttcttt 
accattattt 
tacatataga 
gatgaatatt 
ttttatgaga 
ttgagagagc 
tgtattagtt 
gctggtaaat 
ttatgctgct 
tgtgttagcc 
aataaataag 
acatctatta 
gttctgatat 
agtacagtaa 
agttaggtta 
aaatcaccta 
tacagttagt 
tttcaaatgg 
ctgggaaagg 
fcttttgatca 



catggttggc 
ttcacatggt 
agatctcatg 
attacctccc 
agaggagatt 
aattaagatg 
tatagtagtg 
aattataaat 
aaaatacaca 
atttctgata 
tggacttatg 
agagcagata 
ctcatcaatg 
cttagctttt 
acaggtatgg 
tattctcatg 
gagagaactg 
tattcaacac 
tccacatgat 
tagtcatgtg 
ctgtggtcca 
gttttgatac 
catgctgtac 
taccatctag 
atgatgcatt 
acaatatgta 
attttcttat 
agaagtcttt 
gtgactccat 



aaggcctcag 
ggcagggaga 
agaactcact 
aacgggttcc 
tgggtgggga 
tagggaaggt 
caaaatacca 
gtcatggata 
cctagagagc 
attacatttt 
atgtagctct 
aagtatcagt 
tacagattat 
gtctgtatat 
gtatctttac 
ctactaataa 
tggtccacct 
tatttctgca 
attaatacag 
ccacagaatg 
ataagattat 
ataaatgctt 
acctaggagc 
gtttgtataa 
tctcaaaaca 
atacatgact 
gcactgtgtg 
gaaggatgtt 
gttagcagat 



18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
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aaagttgtat 
catggatgga 
cttggtgatc 
ttcctaagct 
tttgattttt 
aaggataaaa 
gtgatcttat 
tctttccaaa 
tttcccaaaa 
aataaattgt 
ttcactgctc 
ctcagtgaca 
tcttactcca 
ctttttccag 
agcacaagcc 
tccctttgcc 
cccttaggag 
atatatatat 
atgtaatata 
ttgttaatat 
tatctattta 
tagatatata 
aattactata 
tataatgtat 
ttcaattgat 
cgatatgttt 
attaatttta 
atgaaaaatg 



tgggaaagat 
aatgtgaggg 
aaagaataat 
aggaggaaga 
agaacaaaga 
aagtcagctg 
tcattccaca 
tatcagataa 
tgtgaaataa 
gaatgttgca 
ttgcatgtct 
cccttgataa 
acatcaccgt 
tgtttctgcc 
cttaatccaa 
actcctgaac 
gagcaacaat 
atatatatat 
tgtgccatat 
atagagtata 
tataatagat 
ttttatatat 
tttgtataga 
aattctatat 
tacaatctaa 
gtttaaaaat 
tatgatctct 
gacattacat 



caaaagcatg 
atggcaggag 
gagactcatc 
atgaggaatg 
ataaatttgt 
taatattatt 
cacattcttc 
ggttattctt 
gtgacttttc 
tggaaatgta 
tgcctttagc 
catcagcact 
cccctctagg 
agctttttcc 
aattattaca 
cctgaagcat 
gtgcactgca 
atatatatat 
agcctggtgg 
taaatatcta 
atttatatat 
tatataaata 
taactacacc 
taatataata 
ttcagaaaga 
ccagcaatta 
gaaatatatc 
cattaagttc 



aaggccagat 
agatgctatg 
caataagact 
attttctggt 
acatgcttta 
ctttcctgat 
ttgaacctgg 
acgtagaccc 
ataagatatt 
ggtgacttgc 
tgggatgaca 
gatgacctga 
aagaatactc 
ccaaaattac 
gaaactggaa 
ctttcatctt 
gccaattttg 
atatatgata 
tatagttatc 
tttatataat 
attaaatata 
tatatttata 
accaagctat 
gtaacatatc 
tttatgttgc 
ttttcatagt 
actggatctg 
tagcttgtct 



aagaggatac 
attgaatgaa 
ctgtgaatga 
tcctgactac 
tgattcctgg 
accatgcagt 
acactgctct 
tcagttcata 
ttttaaaaga 
attgtgcatc 
gcagttcagt 
acaccacatc 
aggtgagaat 
ttaatattag 
aatgcagaga 
agtctttcct 
aataaacaga 
tacattacat 
tatacaaata 
agatatttat 
taaaaatata 
tatttaatat 
atggtgtgta 
aatacttaat 
catatctctc 
ctaattttag 
ttgtgaatga 
tactacttct 



tgtatgttat 
tctcaatatt 
ttgaatgtag 
agcacaagtt 
ttgaattttt 
atttgtatca 
agacactgat 
taaatatgat 
atgtcttaat 
ctgtgtttga 
gagcagtggt 
ctctgtcagc 
taccaccttt 
attaaggtat 
taataaggac 
aaagccacaa 
agcagcttat 
atttatatat 
tatttattta 
atatattaaa 
taacatataa 
attaatgatg 
tatattaata 
ataatatata 
cttacaatat 
atagttcttg 
taaatcaaaa 
tatgacattt 



20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
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06G 


gatatagaaa 


atttctacct 


ttctgtagcg 


tttaattggt 


gttttctgca 


tgtatttatt 


22380 


ctgaaattct 


ctaatatctg 


caagtgggaa 


ttatgtggct 


aaaattaata 


aaatgtaagt 


22440 


gaaggtaaat 


caaaatagaa 


tctttggatt 


tatccagtta 


tctgaaagta 


catttcattg 


22500 


ccttaattca 


cactttataa 


atttttctac 


ataaagtttt 


tctgtaatat 


ttgtctttat 


22560 


agctgaggac 


agattcagag 


aaacgctcca 


ccacagacga 


gacctgggat 


agtcctgagg 


22620 


aactgaaaaa 


accagaagaa 


gattttgaca 


gccatgggga 


tgctggtggc 


aagtggaaga 


22680 


ctgtgtcctc 


tggacttcct 


gaagaccccg 


agaaggcagg 


gcagaaagct 


tccctgtctg 


22740 


tttcacagac 


aggttcctgg 


agaagaggca 


tgtctgccca 


aggaggggcg 


ccatctaggc 


22800 


agaaagctgg 


aacaagtgca 


ctcaaaacac 


ccggtaggct 


tgtcgtttgc 


cagctgttat 


22860 


gcaaaagtgc 


tttactttat 


tgtttccatt 


caatctttgt 


tttctctaac 


aatagcattt 


22920 


ctaaaatacc 


aaattcttat 


ccatattaaa 


catggagtca 


aatagttaaa 


tagtttttct 


22980 


gtctacgttt 


cacaaactcg 


tcatagaagc 


ccaagtaggg 


cctatatcta 


ggcattctct 


23040 


ggaaagcctc 


ctcataaact 


aggggtactg 


gatgccttac 


cttgccagag 


ttatttcagg 


23100 


taatggggaa 


ataagattag 


gttgctaaag 


caacagttaa 


gtttttttgt 


ttttgttctg 


23160 


cgttcttaat 


gaaagtttgg 


aatttttaca 


ctaaatatgc 


cactgaattg 


cactacagac 


23220 


tctgagagga 


acaagcaatg 


acactaatca 


attggaatgc 


tggagatttg 


aaatattgtc 


23280 


tgtgtattag 


acttcatgaa 


agaagagaat 


gaaatagttc 


ttcaaaattg 


tgccatactt 


23340 


tttttaaaaa 


gactctcccc 


gtatttttaa 


aataatgcct 


aattataaat 


agtgccacct 


23400 


gaagcactaa 


ttaacagggt 


actccaaata 


taatcatctc 


acagatattc 


aaatgaattc 


23460 


tttttctagt 


aattagcttg 


atagggttaa 


gtgttacctt 


tttaaaaaga 


gttgcaaaat 


23520 


ataagacatt 


aacaaatagc 


aaaacatatg 


ttttcatttt 


atctcttcca 


tctctcataa 


23580 


tgtttcttct 


gacagccaaa 


tttttgtagc 


tatgcactca 


gtcctctcaa 


tatatgagat 


23640 


ttttgatcta 


agccaataca 


tttaggaagg 


gaaataatat 


aaagaagcat 


tcacatttta 


23700 


cacattgttt 


cacgaagtgt 


ggtgatatca 


aactctacag 


gcacatatat 


ttgtgtattt 


23760 


ctccttaatt 


agggaaaacc 


gatgatgcca 


aagcttctga 


gaaaggaaaa 


gctcccctaa 


23820 


aaggatcatc 


tctacaaaga 


tctccttcag 


atgcaggaaa 


aagcagtgga 


gatgaaggga 


23880 


aaaagccccc 


ctcaggcatt 


ggaagatcga 


ctgccaccag 


ctcctttggc 


tttaagaaac 


23940 


caagtggagt 


agggtcatct 


gccatgatca 


ccagcagtgg 


agcaaccata 


acaagtggct 


24000 


ctgcaacact 


gggtaaaatt 


ccaaaatctg 


ctgccattgg 


cgggaagtca 


aatgcaggga 


24060 
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gaaaaaccag 


tttggacggt 


tcacagaatc 


aggatgatgt 


tgtgctgcat 


gttagctcaa 


24120 


agactaccct 


acaatatcgc 


agcttgcccc 


gcccttcaaa 


atccagcacc 


agtggcattc 


24180 


ctggccgagg 


aggccacaga 


tccagtacca 


gcagtattga 


ttccaacgtc 


agcagcaagt 


24240 


ctgctggggc 


caccacctcg 


aaactgagag 


aaccaactaa 


aattgggtca 


gggcgctcga 


24300 


gtcctgtcac 


cgtcaaccaa 


acagacaagg 


aaaaggaaaa 


agtagcagtc 


tcagattcag 


24360 


aaagtgtttc 


tttgtcaggt 


tcccccaaat 


ccagccccac 


ctctgccagc 


gcctgtggtg 


24420 


cacaaggtct 


caggcagcca 


ggatccaagt 


atccagatat 


tgcctcaccc 


acatttcgaa 


24480 


ggtaaggatg 


tataaaatga 


tgctggaaaa 


atataaagga 


taaatatgtg 


ttagacacat 


24540 


acattacata 


taaatgtgtg 


tatatatata 


ttttaaatat 


gtataaggta 


tataatatat 


24600 


atatcttaga 


attctttaaa 


gtacacagtg 


agctctatga 


agcttatcat 


ataaacagct 


24660 


agcaaaaaaa 


atagttctca 


ttttgagaaa 


cagtcaaact 


tcaaagtttc 


actgtcattg 


24720 


tgatactagc 


aacacaaaca 


tctaagagac 


ttaaaagctg 


atggtaatac 


ctaagtgtag 


24780 


tgataaggca 


aagtaatagc 


ttgtaaaatt 


tctatagatt 


tccattcctc 


cttttcacat 


24840 


taaaaattaa 


aaccaaatag 


gttttcatga 


cttttggcat 


tcatttccag 


tgtcattttc 


24900 


ttgctggctc 


ttaatgagtt 


ggtgatcata 


aatgtagatg 


aagttgtttt 


ccttgtaaca 


24960 


gattccattg 


gacagattta 


tacagtgtca 


tatcttgaca 


cattaaagac 


aatcaagata 


25020 


tgacataatt 


tgaaactatt 


ccagtgtttg 


gtacagtatc 


acaactgaag 


agtgggctaa 


25080 


gctttctaac 


tcttcatctg 


ctttctttga 


catgactctg 


gtaaggatca 


tgacttggtt 


25140 


tctgttcctg 


gattgttttt 


ggtgttaaat 


atgtgaagtt 


ctgctctaag 


atatcactgt 


25200 


ttttaaatac 


ccatgtgttt 


ttaagtggta 


ggaaaataaa 


tgcagttaaa 


aattggggac 


25260 


aaatatctaa 


acctctctga 


gtctgttttc 


tcatctgcaa 


aatggtagag 


tgtggtttat 


25320 


agttcattat 


gggttcaata 


tttttaatgt 


ttgtttttat 


tctgttgact 


aaacccagaa 


25380 


ctttgatatc 


ttggaaagga 


aagattttga 


aacatttatt 


ttacaataaa 


gcaatttcag 


25440 


atacctgatt 


gtttgaaaaa 


cctaaaggct 


ttattcctcc 


gtagtaatat 


taatgctgca 


25500 


gaactgtctt 


tttaaaatac 


tgattctcat 


tgggaagaat 


gaattatggc 


gtatagggag 


25560 


agtaaatatt 


tctgtttctt 


aagtaaaagc 


caatagtgcc 


ctcctgtggc 


ccattaccta 


25620 


tgaaacaatt 


tctcatattc 


gtcataaaat 


atttcactgt 


aggaaatatg 


gatttcattg 


25680 


caactcaatt 


agtaatcatt 


atgccattac 


ttcatatcat 


tgtatttcca 


tatttacata 


25740 
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aatttgattc 
aaatctaaca 
ggcaactgac 
gccgacaata 
atactcaaca 
tttaaaaatt 
gcgtagaagt 
tactgacttt 
tttgacttac 
ctacccttat 
ttaacataga 
tactgtaacc 
ttgtttttct 
tactgagggt 
aggcagtctg 
cggcagcagc 
aagtcactcg 
gtgggctgcc 
catgactagc 
cttgtctgga 
tgtgagatct 
ctcttccctg 
gccccggtgc 
aaatttacag 
atgcatacat 
tgaatgaggt 
tccttatgat 
agagcaatat 
tgggaaatcc 



taccatctgc 

gcaccagctc 

ataaaaccct 

tcaatattgt 

caactcccta 

taaaaataaa 

tgtttcatta 

gagtgaaagt 

ttttgttcca 

atgagagatt 

gacaggaagt 

cgaaatatta 

tttaggttgt 

gtgaaatctt 

gagtcaccgt 

agccctctct 

ttggcctcca 

aatatgagca 

ctccacacga 

ctgaccacag 

actctctcag 

cactatgcct 

aaaaagatgt 

aggaatagaa 

ttatacccag 

ctcctgaaga 

gacttaagca 

gttaaatttg 

tagttggcct 



ttcatttaca 
tgtttcaaat 
ttggcctctg 
ttccaaccaa 
aacttgcttt 
aaaaaaagag 
taatggttct 
ctcatgatca 
cccacttccc 
ttctgcactt 
gctttgtagt 
aatgtggata 
ttggtgccaa 
cctcagtaat 
cgtccggtac 
tcaataaacc 
gcccagcatc 
gttcctctgc 
gctctgagtc 
gcactcacga 
aaaggtgagc 
aacccccacc 
aagactgatg 
tcattaaagg 
taggcaatgt 
cattttaatg 
ttttgaaaga 
ttccatttgt 
ggttaaaaaa 



aaactaaaat 
cactattaaa 
ctggggaaaa 
ctgctccctg 
aagcgttcca 
aagagctcat 
gtaaataggt 
cttaaattat 
ctctttccct 
gataaagaag 
tcagtacatc 
ttagcttctt 
ggcaggtggc 
gcccagccct 
gggcagcatg 
ctcagactta 
ggttcactct 
aggcagcaag 
cattgacctc 
ggtccagagc 
tttcctggag 
ccattaaatt 
aaaccgggcc 
tagggtgagt 
gaataaaatt 
atttggctta 
ccaaattgaa 
acttctatga 
aaaaaaaaaa 



gttttctgaa 
aaatgtattt 
tacagacaaa 
acagtgactc 
tctagatttt 
ttaagtgttg 
aacagcaagt 
gaaaaccagg 
agtagcagct 
tccaagctta 
aaagcacact 
ggaacaactg 
aaatctgcct 
agtaccacat 
ggcagtgctg 
actacagatg 
ttcacatcag 
gatactccga 
cccctcagcc 
ctgctcatga 
gcattgataa 
cccttgattt 
tttcatttgc 
ggataatttt 
caaggaatgt 
agcttcagaa 
attattctat 
aaaaatagca 
tcaattgtca 



ctaaactcca 
gaatagcact 
ctgacttgtt 
agaccaccag 
gaataaactg 
tctatcgaat 
atggtcaaac 
ggttttcatg 
cagtactgac 
taaaagttca 
tggctctgtg 
aagttgttat 
ctgcacctaa 
tagcgcggca 
gtgggctaag 
ttataagctt 
gtggtctcgt 
gctaccagtc 
atcatggctc 
gaacgggtag 
catcttcccc 
cactgtgagt 
tctcattacc 
gttaatatga 
atttagatat 
caacactagc 
agttatgctc 
gatggattgc 
gccatgaatc 



25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
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attagagaaa attatagtgt cagtgccatt ttcaatagac tgcttaaaaa gtaatcatat 27540 

tacaaagtgt ttctcattgg ctttatatat atatataaac ttaaagtaga ggacatagca 27600 

aggcatttct tacctaatat gcttactgtg aagcatccct tttgagcaaa atcactctaa 27660 

attttctcct caaagtgatc ctctcttgat tatactgtac tgactcttac caccaggaaa 27720 

atgtcttaaa accacttctt tttcctgata aatgcaatgc tatttgtctc ttgacataag 27780 

taaagcttta aacatggtct tggccacatg tggaaagaaa tactggtcac gtaaaatacc 27840 

tgatatatct ttctatgtct tcccctgttt tttttatttt ttttttattt ttatttttta 27900 

actctgatat tgatgatggc atttattttc tagaccttca gccttactcc cggaatgata 27960 

tttttaaaca tcaattaaag cccttagcta gacactctct gcattacgcc agtttcccct 28020 

taatgtagga tgtcccaatt tgaaattccc cattttctct tgactttgta aaatacaaaa 28080 

cccagagcaa aacattgctt ctttccctct ttacttccta cttgcctaac aatgagacag 28140 

ggacagccgt gcaaatgggg ctttccgatg ataaagtaat tttaacacta actaaaatat 2 8200 

tggtgtttcc tatggtgggc tgctaattac aaaatacatt tttcctccta aagaaaaaaa 2 8260 

ctgggccaag gcaaacagct cagtgatagc aaataaaatg taaccatttc cctatggttt 28320 

tgctgttata tgctattata gacagcatac gtaaagacca gtaagggttc atttttccac 28380 

ctaaaatgtc gggcttcctg taaaatcttt gattctagtt tcagcacttc taaggtaaat 28440 

gggcatcttc acatgtcatt tataaaactt ctaatgaatg aattatatta aaatagataa 28500 

acaacctata gttttaatga atgtatccta gattgtatgc tcatatgtaa ggattctaaa 28560 

tatcaacttg ataaccaaac caaacatagt gcaaataggt tatcatttat taaccacaac 28620 

caccttccac aaaactggtc attttttaat tattaagata atctgcaaca agttggccat 28680 

ttagccatca gcctatttct tcagcattta gacattaatc ccagattcag aaataaagtc 28740 

aagtaactat ttataaccaa gtaacattca aatcaaaact agatgaaaga ttggttagtt 28800 

gcatagctat aaccaaaatg cagttttaat attttactct aatctatatt ttaactgaag 28860 

tcaataaaat tttcactatg gaaatacact agaaaatatg caatttctta ttctttttaa 28920 

gcagatttat ttattgtaca tgttcagtct ttgaaatagg ccaattttat ttatgttatg 28980 

ttatgttatt tatttgtttt gaaatggagc ctcactctgt cgctcaggct ggagggcagt 29040 

ggtgccatct cagctcattg cgtcctctgc tacccgagtt caagcaattc tcatgcctca 29100 

gccacctgag tagctggggt tataggagcg gaccaccatg ctgggctaat ttttgtattt 29160 
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tttgtagaga tgacgtttca ccatgttggc caggctggtc tcgaactcct gacttcaagc 29220 

gatctaccct ccttggcctc ccaaagtgtg gggattacag gtgtgagccg tggcaccagc 29280 

ctgaaatagg ccaattttta aaatgggagt attcctacat taaaatggcc aaataaagac 29340 

tttttctaaa ataaacttta aactaatttt ggataaatat gttttgcctt tgagccttaa 29400 

taaaatgcat taatgaatat taagctgtaa aaagtacatg ttaactacat agctatagtg 29460 

tataatatta atattaatta gtgccttcca gtaaattact agattaaaat aaattttaat 29520 

ataagacact gagctttttg ttttcttgac aatagaactg caagcaatag caaattgctc 29580 

taatcctttc acgtacattt aagaaagttt atgacctatt gaagagaaaa gtagatctag 2 9640 

tgggtgatac tggcttcatt atggttaatt aattgatcag tagaatgtca gaaatgctaa 29700 

gaaaaccaaa gaactacacc agagagaaaa tgtgttaatg taaattttaa ggcaagttaa 2 9760 

ttagcgatat ataataaaga tgtatataag ttcatgattt acctgtttgt ctacaatttt 29820 

agatgatttt ttgatactca tatttaaatc ggtagctttt cctatagatt ttaatttttg 29880 

tttaaattcc tcttcgttaa attaaataaa ataataaaat acacttttta acagttttct 29940 

cttctgcagc tgctctaggt cattggtggc cattgagcca taactagtct atatttgttt 30000 

tgggttttgt ttcatgtgtc tgactcaact aaatttttaa ataatttgta gtaaccaact 30060 

ttgcaaattc tgggtttgtc tttaaatgtc agatctggca acgctgcctt gacatttctg 30120 

cctagaaact attggctcta ggcagtcagt gtctgtctgc ttcagactgt tgactgaaat 3 0180 

ccccattcgt tttcatgccc tatctggccc ttgctggcat atgagtttgc aacctttggt 30240 

gatttgcaga aattgtctat gttagaaaat cattaatatc tagattcaaa catatttcta 30300 

aataaagctt taaattatta tggtaacttt aaatgtattt attctaattt ttttcattaa 30360 

attgctcttc atcatataaa tatataattt ttatacaact ggatgagttt ggcagaagaa 30420 

taccaacttt tcatattctt tgtggcatta aactttaact tgtacacatg gaaataaata 30480 

atccttaaaa tgacttatga ccacataaat gccttagcac atgtggttca tatttggaga 30540 

tttctcatat ttgttcaata taatttattt tgtttgttta tccacagtac ttaagaaaac 30600 

ttctatagtc aacatatata ctgtaactgg cctctacaca gtataagcaa ttaccttaca 30660 

tggctattac cgataaagtt aaagttgtat aaagcctttg gatgcttttg atttcagtgc 30720 

taaataatgg agtacacata gaagaaaaca ttttagcttt ggtttgagtg atcaaatttt 30780 

aggtcagcct ttttacattc atgttatatc atccccatta tgcgtatcct gtgtatttaa 30840 

ttttgatcat ttgatgtcct aaaggaagaa agctataatt ctgcaatttt aattaatttt 30900 

111 



qi ::l ^EE'MHQi „, O B Oi Si O £ 



acactttgct 
gttgttgata 
gcttttcatt 
tagcctttta 
atattcaaat 
ttcacatgtg 
gataagtatt 
agccagagca 
atagggaagg 
atgtgattta 
ttagaaagga 
ttttcaatga 
aaccattttt 
tctttgtttt 
tatattcctc 
ctcttccttc 
attttgatgg 
gtagagattg 
agaaataata 
caggcattac 
taaaataaaa 
aaacccatgc 
ctgaatcaaa 
ttttatagaa 
ctgctatcag 
ttattccatc 
tacttttatc 
atgaaaagtt 



tatccacatg 
tcctcttcct 
ccttcttgtt 
aatacctaat 
aaatgatact 
taaatgggaa 
aaacaacgtt 
gatctatctc 
aaaaactaag 
atctattttc 
actcaaatat 
gttaagcatt 
gttggatgtg 
aacagcatgc 
tcagcacaat 
tttgtttgtt 
tttctctggc 
aggctgtctc 
ctaaacaaac 
ttttttgggg 
ttaactcttc 
agtctacaga 
gaaaacatat 
tccttttgat 
atcaatttaa 
cataataaaa 
tgaggaagag 
ggctaatttt 



ccagagatta 
gtatttttag 
accaattgtg 
ttatgatttc 
agaatcaatt 
ataaacatgc 
ttataccatc 
attttgcatg 
atagtattta 
tagtaatcct 
gtgtgacggc 
atatgtgtgt 
taatttggaa 
agcttgacag 
tgctacctct 
tgcaatgtag 
ccgaacagtt 
tcaaattaaa 
aaacaaacaa 
gcagcaactt 
taacttttgc 
tgtgggcagt 
tttcatcatc 
aaaatctcaa 
atctgaatta 
aataaaataa 
atagaatgat 
ggtgctaaga 



taaaagaaat 
agaggccatt 
aattccttaa 
taaaagatgg 
aagttttaag 
ctttttatta 
tctgtcaatt 
tgatatcata 
atcaatagga 
atcactaaac 
aatggacatc 
taccaaaaaa 
gttttgttta 
aaatacacta 
ctgttgttat 
cacatgacat 
ggtgagatag 
taaattccaa 
aacaaaaaca 
tggtagaatg 
ctattagagt 
taatgttgat 
acaggacaaa 
aattgtttta 
atctaatatc 
aaatttaact 
ctactaatag 
atttacttac 



ccctaaactt 
tcttattttc 
aatagagatg 
tatagcttaa 
caaacattca 
aaaataattt 
ggaagttgtc 
gcaaaagtct 
ttcagaggaa 
tgtcattgaa 
ttgtcacctt 
ttatttttta 
cattatgtcc 
cccaaaaagg 
gtaaactttg 
tgaggacgaa 
ccccttaggt 
tgtgaatatc 
aacaaacaaa 
cagaactcac 
catatgcatg 
aggttgaagg 
tgctgtaatt 
atttctattt 
atttaataat 
tatggccatc 
aggtataaca 
aaaaagaaaa 



gtcccactta 
tctagacata 
ataaaattta 
tttcattaaa 
tatatctttc 
gaagacaaaa 
actctaactt 
aatcagttgc 
aattatgcta 
ttgtactgca 
tagttggcct 
tagttcagag 
ttaggggttt 
gactaaggta 
tgtgctgtct 
atcactttta 
agagatacta 
actattttga 
aaacttgtcc 
ttcaacaaat 
caaatattca 
atgctacaat 
aaggtgtgat 
tgcaggggta 
ctcaaaataa 
ttttactgtg 
ctgtatgtgt 
agaatatact 



30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
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tagtttggtg aaacactgaa taatggcgaa actaggtctt tctccattat tttttttctc 32640 

tccaattttt cagcaatagc aaatagctgg caattattcc atgttaatat tttgatccag 32700 

aaatttatgt tccagtaaag cgagcacatc tccctcctta tttttgtaat ctaggcatga 32760 

tgtcaagtgg cagtttaaca aaagaactgt ttttccttta aaaaaaaaaa aaaaacaaaa 32820 

gctgccaata tgtattccat ttccctatgc cttctgtgac catccttcat ttcccttggc 32880 

cctggcccac cactgtcctc catttgtagt ccatgttttc accctcttta catcctttct 32940 

tgccctgtgc ttttgagttc tcaattaact tggctgtctg ctcattgctt atgatttcca 33000 

actgcatatc tgatagaagc ataattttct cctcaaaacc ctttatctta ttttttttcc 33060 

ctatgtgatt caaacagatg gcgtaagatc atctggaaga actgagcaat tataattaga 33120 

ttcaatctgt ttgaaattgt tcattctgaa tagtaacctc ctctgaattg ttttcctgtc 33180 

ctggcattgc cttgcccttg tagatgtgct taagtgtcat agctgtgctg ttttgcagat 33240 

ataccccatc atctcggcag gccaaccaag aagagggcaa agagtggttg cgttctcatt 33300 

ctactggagg gcttcaggac actggcaacc agtcacctct ggtttcccct tctgccatgt 33360 

catcttctgc agctggaaaa taccactttt ctaacttggg taaaatattc taaaatattg 33420 

attttgtttt gtttctttca ccacccactc tcacagaaac cctggaatct ctccataaca 33480 

caacacgttt tcatttaaag ggagggataa aagcacttta acagtacctt tcatttgtgt 33540 
cattgtttac tcttcacaga aaaatctcca aacattatgc tatttattgc tcatgacaaa 33600 
tgcttaacat agattaatac tgtggttgtt ttctagtcta ggctccagag gctcagaaag 33660 
ttcacttgac ttgaaaaagt cttaccatta ctaagggttc aaggcagtaa ccagttcaga 33720 
acatctgact ttaatcccag gggcctttcc attccattta agaatcctct taaaaaacag 33780 
gaaggcatct ccttatttat ttgtctgaaa tattaaaaca tccttaaaac aaaattagta 33840 
atcttttgta gaaaatagaa acaattagga agaaaaaaat atgtaattcc atgactcaaa 33900 
gttaacttct tttaacactg ttaaagttaa aactccttaa aattcataca agaatttctg 33960 
ttaagacaat actctgaaca ttttcaaata gatacaatga aaaataaatt accaacttag 34020 
tcattgggtt actttgtatt taacatcatt tgtatgaaat ataaaatcat ttgcataaaa 
tttcattaaa agcactctga gtaacaaaat aattaaagaa aactaaacat gccagatacc 
atttaataga ttcaatgact ttaaaaatat atttattttc tataaagtca catataaagt 
attttcatta tttttatggt aaatattttt attattagtt tatcagaaaa acttgtacat 
aaagatgagt attgatacat aatcttatta gagccagaga cgatcattcc ttctagaaaa 
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acacatctct gaatttagga cggaggacaa tgaaacaaga aatttcactt tataatttac 34380 

ctttgtcaaa ctatcccaga gcacatcaat tccatcatga aagtactctt ttgacattat 34440 

ataaaaaatt agtaatagaa aacacacaat ccaaaacctt atattttcta aacttcaagt 34500 

taatcatcaa cttctcttag atttttgaag acctgaaaat aaacataatt tcaaataaca 34560 

gaactcaaac accatataca tttgtaatga ggcacaacag tcaattttga gccttgtatt 34620 

ttccaggttt tagctgaata atcttcactg ctttcttagc tttttgccag tctagtttgg 34680 

ggactatttt gccttactgg gcctaaacag agtgtaatat taaaatatgt taataagcca 34740 

tactgagaat aagataaatg caggtttcta actccttagg gacacaagtg gggacaacac 34800 

attccatgaa cacaggtgaa tgaatgcccc tagtttctct gagttggaca atttcatgcg 34860 

atcatttttt tctctgaggc caaagtctct ggtttgatct tcctagcagc ttccagaaca 34920 

gaaagtgagt ttactttgtc tccatattct ttttctccat gctcgggaat cccctgcttt 34980 

cctgatccca ccacaaaaac tcccctgagg atgaagcctt ggctttccag gcttccaggg 35040 

aagcctcgat tcctggctgg aggtagttgt accacactcc cagagggcta aatcccataa 35100 

acatcatctt ctgtctttgt agatcataga actttttatt atcatccagg aagatttctc 35160 

ttttgaaaca aggctggaaa aactttatgt cagtcctgac ctgctcttta atgactgcgt 35220 

agagggagat gcccagctta tccaacttgg gttgcagaga ggacagatct gcagcccctc 352 80 

ttgccagaga aaacatcctg gcacagccac aatcacaact ccattcttct cccgatagct 35340 

cctttgcttt gaaactcatt ggttacttct ccagtgtttt caggtctata ttctccaggt 35400 

actccagcac ctctttccag ggcttggaca aaaatacatc tgtgttggcc agcatcagtg 35460 

ccaaggcagc agcctccaag ggctcctgca cccatggacc acatccacac agagaagcac 35520 

cttgggtcct caagtgcctc cctcttcttc ccttctccca aacctgaagc ccagacacta 35580 

aggggtcaaa ccctcctggg ccctgagggt tccaagggcc tcattacttt ttcttttttt 35640 

cactggaaaa aaaattctaa tcatgcacct acagaagatt gacatttttc agtaagttgg 35700 

actttccagc tttcagccag gacaagactc aaggctatgt cttttctatt gcaacccttc 35760 

ccactatatt gagtagggct tttagcaatt gaaaacaatt attttggtca tggtttcata 35820 

taagctaatg atttcatatc aaacaccaag tttttgtttc ctaacctata tagtgataag 35880 

agaatttacc tataatgcca aagaatgtat agcttttatt tgctttaaga tgcagttgat 35940 

tttttaaaaa agcgaaaagc ctaacacttt aacttcaaaa aatgaattta aaatgtttgt 36000 
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gtaggtcata 
aagtgctata 
taatttgatt 
caccttcatt 
aagtcgttta 
gtaaagaatg 
ctgccaaatc 
cagatatgcc 
aggattttat 
caaaactatg 
gataagttta 
tccatggtca 
attacatagt 
tgacaacaaa 
atacatccct 
taggatgtgg 
tcagacaggc 
acaattcttt 
aggtcacttt 
ccctgggcaa 
ttttcctgta 
attctaatct 
taaaatccca 
tcttttcttc 
atctgtgtaa 
acaggaacct 
cattgtcctt 
tactctgtcg 
attcctgtcc 



ggaatatgaa 
agcttgctaa 
agttgactcc 
attccttagg 
aacttaattt 
acggatatag 
attattttca 
tgagctttgt 
gtaaaagaca 
acattgttat 
tgctaagatt 
caatgtgtta 
tttcagagtt 
gctctctgta 
tacatttcat 
gtttcccccc 
catgacttaa 
ctcctggggt 
tgagagtgaa 
tgcaattcat 
gacttttttt 
ctcaaatcaa 
tcagtgtgct 
tatttgtctt 
tcaatagtct 
gtgattttat 
gctgaagctt 
actcccttat 
gtgtcaacat 



aaaattttat 
gtacttcatg 
ttcttctatt 
aacaaaaaga 
tcttatctct 
tgtatgtaaa 
ctggaaagca 
ctacatttaa 
ttcacagatt 
caaggactca 
tgcataaatt 
aaaaatgaat 
aaaacacaat 
gagctttaat 
tgctatagaa 
ttaatctatt 
gtgacactgc 
tgctcatctt 
tgggactata 
cctccatctc 
tatcctaaac 
catttcactt 
agaccatata 
ataatttcat 
tacttgaaac 
cttcagaata 
ttttaactat 
gtttcaacat 
aatctcatgt 



acaacatcta 
tctcctatca 
tttcctcacc 
ctcaccactt 
tgaatggata 
gatggagaag 
tgtcttacac 
ttgagtagta 
gtgtttttga 
tttaccacaa 
aaagttaaca 
ggttctgtag 
taataaatga 
gttctaatga 
accaagtctg 
gtggtttata 
cctcttgatt 
aacatagctg 
ttaataattg 
ctccttatat 
cctttttcta 
tctgtctgag 
gccacctgaa 
gtatcatcct 
tgtgctcttc 
tctcctacat 
agacaattgc 
tttcacccat 
acctctccag 



aaacacaccc 
attctttcat 
attattattc 
aactatgtct 
cataatacct 
tgtgtaagac 
gatcatagag 
attcgcaaca 
aagattgtat 
atatcaaata 
tggcaactga 
cacacttggg 
aatgtgaatt 
attagaaaac 
aaaggttaag 
tcagagatct 
ctcttcatac 
tatcatttat 
ttccaggtat 
ttatgtgttt 
tgttctcatt 
acctttttca 
atcaaagtct 
tctctctact 
atattgtaca 
ctgtctctca 
agcagatttt 
tggaaggtat 
atcttagaaa 



aaatcaccta 
taattgacgt 
tgattaaatc 
gacattggtg 
aggttatatt 
ttgacagatt 
tagcattcat 
cagtaaccac 
ttttgaagta 
tttgtgcaaa 
agctaacatg 
aatgtatttt 
atacttttac 
cactgatcaa 
tttacctttc 
ctcagctgtg 
ttttccaact 
tgtagacaca 
taggtgcaaa 
accaagttgt 
cacaacttta 
gctctaaaac 
tttcttaagt 
ctagcacaaa 
ttttcaatag 
ttttcaggga 
aaactgatct 
aaaagaagat 
cacgtatggc 



36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
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ttcaaatcag 
attcaaaaac 
tcattgggca 
ttaaatttta 
catcacccat 
ttattaatgt 
gtgttaatat 
attaagatca 
ctatggttcg 
aagactaaca 
aaaataaccc 
ctctaagatc 
agaggtcata 
tgcatgataa 
cattaatgtc 
ttttgcatat 
caaatcttcc 
aatgaactcc 
atctggttca 
aacccaccaa 
aagagatcaa 
attctacaaa 
taataaactc 
atttaaaaac 
ggaggctggg 
tggcactgca 
taataatatc 
acaaataaat 



gcatttggag 
ataatattta 
agaaaatgag 
aatatatata 
ttttgatttg 
tcaaaagttg 
ataaaatttt 
ctacaatatg 
tgtgtcgttc 
aagtatagct 
aatcgctacc 
cattcaaagc 
gttcttgtga 
caatcgctca 
aacctgtcac 
ggtgattgtt 
tctaccttga 
tgtcttctat 
acccttcatc 
tgacagaaac 
ataaaatttt 
ataagtcaaa 
actagttaat 
ccagtccagc 
gcaagaggat 
tactccagcc 
ataaaaaccc 
atataatctt 



atctttatgc 
aagagttttt 
tactcttaaa 
tatatactta 
accattatct 
cagtattcat 
tagaaatttt 
tcatatttag 
ccagcatgct 
tacataaatc 
cttcaatttt 
caggagacac 
gccatctcat 
gattcatttt 
ttctaatatc 
aaacaccttt 
atcctttccc 
tctttcggaa 
tcttatgtaa 
tactaaaatc 
cctgaatctt 
gttagattgc 
gtctagctgt 
cagacacatt 
cccttgtcca 
tggaagacag 
agtccacatt 
aataaaaggt 



tgtatggttt 
attgtattta 
atgcaataat 
ccttaaatat 
tttctgaata 
gttttcttta 
actataaaaa 
tagactactg 
agccctagta 
aaaaagtctt 
ttatgaatga 
acaagaattt 
aacctggtga 
tcatcttgcc 
catctaatat 
gcataggaac 
atcttcgtgt 
gcatagaatc 
aattttatga 
tagaaataac 
cacctattgt 
atatgacagc 
agatgcaaaa 
ggatcatgcc 
ggagttacag 
agtgagaccc 
tatataggat 
tagtggcttt 



cagagtggaa 
cagttcacct 
aaattaaagt 
gtcctcttgt 
atcagtaaga 
ttctttctac 
atcacaacat 
tgagctactg 
gaaaccattc 
tggatgaaac 
aaaaatggaa 
ctaaatagaa 
gactcattgt 
acaagggtta 
tctctaaatt 
agtttctatg 
tcaaccttca 
tcacggtcag 
catctctagc 
acctttgaaa 
tcctagttat 
tcttcatatt 
gtagagagtg 
tgtaatacca 
gctacagtga 
tgtctcacaa 
cctgttttcc 
gccaagatag 



aaaatgattg 
gaacctctgt 
tactttatta 
tgtcttttag 
tacaggatta 
caattaaaat 
atattagaaa 
ccacagtaaa 
ccattcaaga 
ttcatttggg 
gaataaaggc 
gagaaacaga 
catgcctcca 
catgcaggaa 
cgatggatcc 
cttttgtact 
atcttctcag 
aagagaccac 
ttcttcttta 
ttctttcttt 
atatatccag 
taaaacaata 
acttggggtt 
gcagcactca 
gctatgatcg 
taatagtatt 
tcaagttact 
tggcttggct 



37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
39360 
39420 
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atgcaaatgc aatttaagac aaagttggta gccctctttt tcctaataca ttgccatatc 39480 

tgtttctctt ctatttggaa attcttgtgt gtctcttggc ttcgaatgga tcttatagtc 39540 

cttttattct tccatttttt agtcataaaa aaactgaagg gtagtgattg ggttatttgc 39600 

ccaaagcaga tggaaagcaa aactaccact agaagctctt taccaatttg tgttccattc 39660 

aaaaaattat ctttgtatgt cttacatttg tcttctactg tatagttttt cttgttctat 39720 

tttacatatt aacttttctc cttcttcaga catctgccct actggctact cttgaaatca 39780 

gagactgtgt catatttttc cttctattca actacaacat ctaaaagcag atctgtcata 39840 

gttattaact taattgaaca ctcttaaata gttaggtgta atttccaatg cagaagctat 39900 

caaaagggtt tgtaaatgca aactattccc tttaaaatct atcctaatcc tcattaatgt 39960 

ttcatcttga tagagctaag tattatgtat tgaaattgta gaagtacact tcacttggat 40020 

atctctgcaa tcatttaggt aagaattata caaagccaaa aagcaaataa aatatcctcc 40080 

taaccctata gatacgtata ctaaaatgat gcacttgcaa atttgtttaa tacttcatta 40140 

atttaaacaa gagtaaattc atactgtgaa ccaagaatag ggtgacttac cccaatcttg 40200 

ccaccttaaa cataaacatt ttaagtcttc aatgtcctac agtgtaccta ctggctgttg 40260 

tcactaatca gaccgaaatg gtactaatgg tcactgcagg ctgaaggaat atgcttgaaa 40320 

gataggcaga tcctctccct ctcccttttt tacttttttc gcctttccat cctttcttct 40380 

ttttttccaa tagattgtgc actttggaga ttcatatttt cttccttttc cattacattt 40440 

taaatatgtg attcttagtc ctatgcttcc ttttactcca atcaataact ggctctatca 40500 

gagggttgtt ctgtgtgtta attcggttaa taccaggatt atcaagcaca gtgccttcca 40560 

aatgtgagat acttctctcc ggttacctct gggtttactt ttcctgtttt acattgtttt 40620 

gagagccagt acttgtatta agaagaagtt tagtgcctgt gtcacagaaa aaatcttagt 40680 

aaattttgaa gtgatgtcag aacaactcta agccactgac ggattccaca gggttttgaa 40740 

aatactcgtt agttcccttt atatcttaag aggctcctgc ctgctttctc atataccagt 40800 

aacaaacttg cttttcttaa atatgagcat ttagaatatc tttctcaatt tttctgtttt 40860 

gcttttattc caaatttcac aactatattg ttttccaatg tagttgtaca tacaatcaac 40920 

caaatctttc cttaaattga tgactaccag gtgaggactc tttggcaata agcaataaga 40980 

aaataaattg ttattaaaaa ttacagactt aagatacttc tttggaaata taacatgttt 41040 

gtgacttttg accatctcat catgatatgc tcatcttaaa eagagtagaa aatcatttca 41100 

tataattaac tttatggtgg gctgcagata ccatgtatgt tacattgtgt ttagttataa 41160 
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aaatgtttat 
catgaactta 
aagcccttct 
ccattaaagt 
cttcataaca 
gacttgttgg 
ccagaggaaa 
ccttaaatag 
tttctatcaa 
gtttttcagt 
gctcaaacag 
gtgggagtgc 
gagacagcat 
accctgagag 
actagtgtaa 
tctatttctt 
gaagagttta 
aggtgcatca 
acacacaaac 
ggaattccct 
tttttgcttt 
ataaatatct 
cctcaggaag 
tgtcaaattt 
tccataaaga 
actagacata 
aaagaaaagt 
ttgtggtata 



tatacactat 
catcaattaa 
tgggaaattt 
gattactatt 
ggtagcagaa 
ttttgtgtac 
agagggacgc 
ggactcacta 
cctgctattt 
gagcccaaca 
catcccagcc 
cacttctctg 
ggaagaaggt 
ggaaaccatc 
ttatcatttg 
tatgtatttg 
gactaaatct 
aatatcagag 
cagaaacatt 
ctggtgctaa 
ataagtcact 
ttatagttat 
ctgagttgat 
aacaataatt 
aaacttacag 
caaattcaaa 
ctccaaatct 
aacttaggca 



ttccttataa 
gagcttgaag 
agatgtttta 
tttaaagaga 
aacaccatgt 
tgttgcttta 
aatgggatct 
tacattcatg 
tcatctttca 
aatttgtctc 
caagactctt 
gaggaaagac 
aagcgttgag 
gtgtcactca 
ggaaactaga 
gaaatatgcc 
acttttccaa 
caaatgtttt 
attttatgaa 
ttttttatta 
atttcttcat 
atagttctat 
gctcaaatat 
tgagagactt 
catacatatt 
ctgattagaa 
tgagacatac 
agtttcttaa 



tctaactttg 
tgactgagag 
tattttactt 
aaaccgaaaa 
tattacattt 
gaaattgatg 
gttattgaac 
ttttcataag 
agcttaagag 
aatttaacct 
ccttcgatct 
ctcgtgccat 
ggggattaaa 
catcacaaag 
agcttgaaga 
agaatttgtt 
tacagaaata 
gtttgacatt 
aataccacat 
tatcatttta 
ccaatggcaa 
gtaaatactc 
atatattttt 
ttctctttga 
ataaagcatg 
tgaaattatt 
cagagtttaa 
ccttcttatc 



ataataatga 
tatttgcctg 
tctttttgat 
ctctagaaag 
ctagcaagag 
taaggcttcc 
atttcagagg 
tattgggatc 
taataggctc 
tcccgggccc 
ctatgatgac 
cagtcattcg 
gatgaagtca 
attcctgaag 
agttttattc 
tatattaata 
tacatataaa 
ttggttaaag 
gttgctgact 
gaattcatat 
taaaattgtc 
caaataaatc 
gtaaactgta 
tttaatgaat 
tcagctaagg 
aaccctaata 
gtcttcagcc 
cctaagttct 



tggtcctaat 
gaagcattta 
tttgcttttt 
accatctttt 
cagtagaggt 
cataaacgtg 
cagactctta 
atgttcttac 
tgtgtgtttt 
agcatgatgc 
tcccagcttt 
ggctcattca 
ctttatttaa 
aggaaaataa 
tgtattatct 
cttggctgta 
ctattttccc 
agccataaag 
tttattccca 
tgtacctact 
acctaaccta 
agcttgaaaa 
gaagctcaaa 
ttttttagta 
ataaaataaa 
attatgttta 
atccatttac 
gcatctgtaa 



41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
42000 
42060 
42120 
42180 
42240 
42300 
42360 
42420 
42480 
42540 
42600 
42660 
42720 
42780 
42840 
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cttcttaggt 


ttgtcacaag 


gatgaaatat 


gagaacaaag 


aataattctg 


ttccatgatc 


42900 


ttttcccttc 


ctaccttctt 


atttaaagta 


tcttctgact 


gaggggttag 


gcagcaatga 


42960 


aaattgactc 


atgtttttca 


ggtcaccact 


atggattcaa 


tatactggca 


ttaaatcagt 


43020 


agagaatagt 


tgtcattgcc 


ttttgcaata 


ttaaccaaac 


cactcagttc 


actgtgacag 


43080 


acagtgaatt 


atatccaatg 


actccactga 


ttttttccat 


gtagatagac 


aaaatataac 


43140 


tactctcaaa 


tgtaaggacc 


ctgctttctg 


aaatggttct 


gttgctctct 


tcacagatag 


43200 


gcttcttata 


atacttttaa 


aataatttgc 


taagcataca 


gatggctttc 


tagagtgtgg 


43260 


cattgacaaa 


taaagtgatt 


tttatatact 


gggaaattct 


ggccttcaat 


gtatcaggat 


43320 


taaataatct 


gaatttctga 


aagctagcct 


aagtgggcaa 


gatggctttt 


ttgtgctcac 


43380 


gcattgaata 


ctgaactatt 


ctagttctta 


aatggcgatc 


tagattcaag 


acttattgaa 


43440 


ctagattgaa 


gggactttat 


tgatatccta 


cctaatgctc 


acactgacag 


atgaagagac 


43500 


tgagccacat 


gttctaaggt 


cataaacaga 


aagaatgaga 


atgagatggt 


ctaattaatt 


43560 


gtccaccttt 


cctatggtac 


atcagggtaa 


cactttagtt 


tacgagggta 


ttattagaga 


43620 


tagaaagaat 


ttttttttaa 


ataattgact 


caaataccaa 


cattttgcac 


attacataga 


43680 


gtaatagctt 


tgcccaagtt 


agaaaactgg 


gggttcttct 


ttattcctct 


tttgaccaca 


43740 


tctatatact 


cagttttaaa 


aaggttcttc 


ctggtatcct 


tcaattccat 


ccccatgttt 


43800 


tcatctacaa 


gcctagtgca 


gctattccag 


ccgtctcctg 


atcaggtctt 


aagcacctcc 


43860 


catatgtcct 


tgtagtaccc 


accatattga 


tctcagtagc 


aatcacagta 


ctctattgta 


43920 


aatatctttt 


aaattattat 


cttctctttg 


agcttttggg 


attttatctt 


atttattttt 


43980 


gtagttccag 


gatctagcaa 


cagcttgtca 


catcgttcat 


actcaactaa 


tgtttgttta 


44040 


atgcacaatg 


agcagaaata 


aacatactac 


tccatagtaa 


aaagaggatg 


aacttttctg 


44100 


caaatattaa 


tcagcaccat 


tttatccacc 


ttttgggttt 


agtacattgg 


aagtatagga 


44160 


gtataaagca 


gaatgtccaa 


tgtttacagt 


gatattttga 


aatagataaa 


agccagtgcg 


44220 


acatttccat 


tctcaatttc 


tctgagacat 


caccttgaaa 


aaaaaaagta 


tttttctctt 


44280 


cc taaaatt a 


gtaaaggaac 


agtaattcca 


catttataag 


agtatgatca 


acgcatcaca 


44340 


gataatgttg 


taataacaca 


ttagataaaa 


gtgcttattt 


tcctgaaatt 


atatggagaa 


44400 


aaaaatctga 


aagtggacct 


ttgttggata 


caaatgaaat 


aaataaggta 


catacatttt 


44460 


ttaaggttcg 


aaagtttatg 


gcaactttag 


tttgggtttc 


catgctattc 


tatttattat 


44520 


atgggaattt 


actgtagctt 


tcaacatgta 


cgaaacaggc 


tggtagggct 


catgcttgta 


44580 
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ggcttctgtc 
atgtctcaac 
tagttgttat 
atttttgagt 
gctcttcatt 
gttaagaaaa 
tgaacagatc 
tctaagctgc 
tacatgtgct 
cgatttcata 
ctatactatc 
taattgttct 
atcgaagggt 
tttttaaata 
atttatttat 
atgacacaat 
cagcctgtca 
tgttttagag 
caattcctcc 
agatagaaac 
gtgacattat 
cttttttaaa 
ggttaacagt 
tgaataagcc 
tgatcaactg 
tgtttgaacc 
aaatctgact 
ttacccatag 



taataacttg 
gtcctgactg 
tacttaattt 
agaccttatt 
atcactggtg 
agcttgtgct 
acattcatct 
ccaataccca 
ttcacttggc 
aaacctactg 
attttagtaa 
atagtatttt 
aaaagcagta 
taccttttga 
ttatttattt 
catagctcac 
agcagctagg 
acaaggtcat 
tgctttagcc 
tttatttatt 
aattcatttt 
aaagtagcct 
gaggtatatt 
ttaatcctgt 
agcatcatgg 
ttctctttat 
gttattggca 
cttctagaaa 



gcaactgagg 
ccaaaaaatt 
ccctaaccac 
gttttaactc 
tccagcactt 
tttgccatgc 
atgacttgca 
ataaagtgcc 
ttttgtgcac 
aggtatgact 
tgacgagttg 
cttctcattt 
agggtaaagg 
aagagtgtga 
ttgagtcgag 
tgcaaccttg 
actacaggca 
tgctatattg 
tccccaagtg 
aagagaaaaa 
tgacaaggag 
acaggaagtt 
taaaagcagc 
actgtaaggt 
gcagtatttt 
agattaatct 
gaagtaatat 
ttaagaaatc 



tactttaggg 
atagcaggct 
ccgttcttta 
tattgttctg 
cttctcttta 
acacagatga 
caggagttgt 
aggtgctcca 
tcatcataat 
tggagtctct 
tccatgtttt 
tttattttta 
gtaaaagcat 
tttttttaac 
gtcttgcttt 
aactcctggg 
cgcaccacca 
accagactga 
ctgggattac 
taccagtgtt 
agtttttctg 
atattttatg 
ctactgaatc 
tcatgaagag 
tttcactcat 
catatattta 
ttttctaatc 
acgattagtt 



agtatggatg 
ggttctcaga 
ctttttctgt 
tttgttttct 
ctctacagta 
tgaaatagat 
gtagcaaaat 
cctgccattc 
caatgagtgg 
gaaaccatgt 
gttctttgag 
agtttattgt 
aaaagaacca 
ttttattttt 
gtcacccagg 
ctcaagttat 
tgcccagcta 
tcaataccca 
aggtgtaagc 
tcaagttctt 
tttggtaaat 
agtgagtctt 
tcaatgggac 
ttcatagcct 
tatcattagg 
ctgccttaca 
tctcctttca 
tttaggtaaa 



gggctcttcc 
atcttatagt 
aaaggctgga 
ccagttcatg 
agtaatggct 
cattttactg 
aacggcatac 
tttggtcact 
atgtagaatt 
atgtagtctg 
ccgtgactgt 
tgagaggatt 
gagatgtttt 
attttatttt 
ctggagtaca 
ccttctgcct 
atttttaaat 
tggcttcaag 
cagcacactt 
ttgcaaacgt 
acaattctat 
tttagagcta 
ttgagtacta 
ctgctgtcac 
ttcaaatgtt 
tagtcattca 
atgattaaaa 
tgtacttttt 



44640 
44700 
44760 
44820 
44880 
44940 
45000 
45060 
45120 
45180 
45240 
45300 
45360 
45420 
45480 
45540 
45600 
45660 
45720 
45780 
45840 
45900 
45960 
46020 
46080 
46140 
46200 
46260 
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gtgcaaatgg 
atattaagac 
tctgatttaa 
ttattctaaa 
aaaactaaat 
tagtgattgg 
atagctagaa 
gaagaatggg 
accacatgca 
gaaaaattta 
ccagtgtcca 
tagtaactat 
attgactttt 
agattggata 
tgtcaaaatg 
tatgaaatgg 
tatatacata 
tataggccaa 
ggtgatcgag 
atgaaaatga 
tatttgggga 
ccgccatgat 
ttaaactatc 
agagatttag 
ggcaagttgc 
gatctaaccc 
atctgcagag 
gccaagcgcg 
cacttgaggt 



ataaagtgag 
ttagtgttcc 
accattgcca 
tacatacaat 
taaaaagaaa 
tagcacaata 
aagaagattt 
atattccact 
ccccataaat 
ttcctcaaga 
gtagaccttt 
gaaccatatg 
ttattttaat 
gtgcaagttt 
atgaggataa 
gaacacagat 
tgtatgtgta 
tatatggagg 
ccacatgaga 
tatatttgaa 
actacttaag 
caggccttag 
acatgaaaaa 
ttgatttgca 
caaatttctt 
atttgtctcc 
gtgttacctt 
gtggctcatg 
caggagttcg 



gaatgtgtaa 
tcctgttggg 
tttacatcta 
aattaatatt 
gaaagaaaga 
gagtgactat 
ggaatgttcc 
ttcccagatt 
acatacaact 
aaaggatcat 
ctgtgttaat 
ttgccattga 
taatttacat 
atagagaaca 
ttagaaagca 
taaaatagag 
tatatataca 
tagggtatat 
aggtgatatt 
ttgaaaagta 
agaaaaatca 
gatttaattg 
gcaatttatt 
tttttgcctg 
tggactatac 
ggatgtctgc 
tggcaaactt 
ccggtaatct 
agaccagcct 



acacacatga 
ccagcactgc 
tgtgtaacat 
tggataaagc 
aaaatacact 
agttaacaat 
taacaggaag 
tgatcgttac 
attgtgtatc 
gagtttaaga 
aaaagtgttc 
ttatttgaag 
tgaaatagcc 
caaggggtac 
tacatgagaa 
tatgtatata 
tatatatgtg 
cctagtgtta 
attagaaaat 
aactgtagta 
taaaacatga 
gcaatgagaa 
atttttcatg 
gttcaatcaa 
ctataaaata 
aaagtggttg 
atggtttttg 
caacactttg 
ggccaacatg 



aaaaaacaca 
catttgttgg 
caaaagatgt 
taccttcatg 
tagatagaag 
aatttattgt 
aaatgatatt 
acagcatatg 
ccaatattaa 
aaaaacagat 
tgtatctaca 
tatatctggc 
acatgtgcct 
atttgtagat 
atattgtttt 
tatacatata 
tgtgtgtata 
agtgagtaaa 
tgaaagttgt 
aaataattca 
ggagtcattc 
aatacctatg 
ccttcttaat 
gaaattatcg 
aaatttgaaa 
gaaatcacaa 
tgtttgtttt 
ggaggctgag 
gcaaaacccc 



taaaagaaat 
ggaattgtat 
agcatcatta 
aaacctaaga 
aaataaggtc 
acatttcaaa 
cttcctaaat 
tttgtataat 
agattttttt 
tactagtcta 
ctatctaata 
aaagagatga 
agcagctact 
aggagtggga 
aagagtagaa 
tatgtgtata 
tatatatatt 
gaatggatta 
atttgagatg 
aataaatgaa 
tttccccagt 
aaaatgcttt 
aactctcaat 
cgtgacatca 
atattagcta 
gcctaacctg 
gaaatctaag 
gcgggtggat 
gtctctacta 



46320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
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aaaatacaga 


aattagcccg 


gtgtagtggc 


atacgtctgt 


aatcccagct 


atttgggagg 


48060 


ctgaggcagg 


agaatcgcct 


gaacctggga 


ggctgaggct 


gctgcagtga 


gcgccactgc 


48120 


actccagcct 


gggcgacaaa 


gccaaacact 


gtctcagaaa 


aaaaaaaaaa 


aaaaggaaaa 


48180 


gagggagagg 


ggagggagag 


ggagagggaa 


tctaagccaa 


cactgtgaaa 


tattgtgaaa 


4 824 0 


tatggagctt 


ctacctaaaa 


attcaaaatt 


ttaaattcct 


tttaaaaata 


attggaatat 


4 83 00 


ctatggaata 


tctagcaata 


ctaagatgaa 


attcctctgg 


gttttcagtc 


acctgtaatt 


4 8360 


gacaccttta 


gatgttggca 


tgggctctca 


ggaagccaca 


gcctccacca 


atgcttttct 


48420 


tcctgacact 


gaagctaaat 


ttgggtggct 


agttttcatt 


gtgctgttgc 


tttcctcatg 


4 8480 


ggaaagaaat 


accctttgct 


atttatattg 


ctgtcaaatg 


ggaaaatgaa 


agacagccaa 


48540 


ggaagatcat 


gtgactattt 


aaatacttca 


agtccattta 


ttctttatta 


gccttgtcct 


48600 


gttaggcatt 


taaatttttg 


atccctgcaa 


tagatgtttt 


ttgattaact 


gtatattaaa 


48660 


aactatattt 


aacctgtttt 


gaatttgaat 


tctaaattgt 


attttttcat 


gagagcaagt 


48720 


gtcatttttg 


attcattgtg 


gattgtttaa 


catgttgcct 


aacaaatagc 


taatactaac 


48780 


gtcataactt 


tttaattagt 


aaatttgaat 


ggataaatgg 


ccacttattg 


gcttatagaa 


48840 


taaataaaaa 


catttttatt 


cagtcaagtg 


tttcatattt 


tttatcatct 


ccaggacatt 


4 8900 


gggcttgctc 


aaaaccattg 


ttaaaaaaaa 


aatggcaaat 


aatccagttc 


catcatgata 


4 8960 


tcattaatcc 


cacacctaag 


ctactgaaaa 


aaatatatta 


atattctggc 


tcattgcttt 


4 902 0 


atttttatgg 


taacacccac 


ctggtattaa 


taaccacaga 


gtacgaaaga 


aggcaaaggt 


4 9080 


taaagcaaat 


aatagttttg 


aaaaattggt 


agtgaaaaaa 


gtcatgctat 


acggtatgta 


4 9140 


tataatagat 


atttaatgat 


tatgcttgct 


actagtatat 


gtaacaggac 


tattatagat 


4 92 0 0 


taacaaaaat 


gcggtgagta 


tatttcttga 


ttatttttta 


aaagaataaa 


ttattattta 


4 92 60 


aaaatacatg 


aattatttat 


tgattcttga 


atctttacca 


gctttctata 


attctaggaa 


4 932 0 


gcctagaagc 


agaattgggc 


aggataaact 


ggcaaaaaat 


gtaaaaagta 


ggccgggcac 


493 80 


ggtgggctac 


agtgagtcgt 


gaatgcgcag 


tgcacctgag 


tgatagatca 


agatcctgtc 


49440 


tcaaaaaaaa 


aaaaaaaaaa 


aaaagaaaga 


aagaaagaaa 


aacaacaaca 


aaaacaaaag 


49500 


caaagtacta 


gggaaaacta 


atagacatag 


ttacatagtt 


aattgtgcca 


tatgttttaa 


49560 


ggcaatgaaa 


cttttatctt 


aatattcctt 


gcttactttt 


tattcaaaaa 


ccaaactgtg 


49620 


tataaaacct 


taaaattatt 


aggatctaaa 


aaataaaatc 


tttccttaaa 


aatctaaaat 


49680 
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tgagatgtaa 
ttctaccact 
aggtagtgaa 
atatcctgat 
gttcaggtag 
gagattgttt 
acattatgca 
ataactcatt 
ttctggaagt 
ctgcatagcc 
gtttaagcgt 
aagtagaagt 
agctttattt 
cttggctaga 
cttcaatatt 
ggtttatgtt 
aatattatgg 
catttatata 
tttcattcta 
tatacttgtg 
tacaggctga 
tctattttag 
acaatagcag 
atattgcctc 
ctagattgag 
tcattgataa 
gtaagaccat 
atactagcat 
tatgtgtggt 



attattcaag 
tagccacttt 
aatataggga 
ggttagacta 
ttttgtgtca 
ttcagctttg 
ttcttcaact 
ttctggcatt 
caagtcaaaa 
aaaaatgttt 
tttgctgact 
aaataatgtt 
gacattgttt 
ggtctctata 
ttaagtttac 
gcttatgatt 
atcaaatgac 
aaattctaaa 
ttcttctcat 
gaattaattg 
agaaaaggct 
tttgcattca 
aactccacag 
ttaaaaataa 
tcttagctca 
atgaaggtct 
ctacacagga 
ctcagactat 
gtgtcaccta 



agtgcttttt 
attatttagc 
agacaaagct 
ctttgcttta 
attaatctag 
gatctgctta 
tttcccagaa 
ttgcagccca 
acaccctgca 
gccagacact 
tgtgataatt 
aattatctgt 
gtgatcagta 
attttatgga 
atgcacccat 
tacatcttag 
ctatatttta 
gtaacttgct 
agtagataat 
aaaatagttt 
cattcagagg 
tgatgaaatt 
gactgctgaa 
aatgaaagga 
acacctaata 
atggtcttcc 
taatggttga 
ctgttttgtg 
ttgttctaat 



aaaacagttt 
cactatatta 
caggttaaaa 
tgttttctga 
aactatacca 
tggcaataag 
taagggagct 
gcatgaagaa 
aattcctatg 
tttatcactg 
aaaattatta 
ggttatcagt 
aatcacagaa 
gtctgtttcc 
cttttttaga 
agtcttttaa 
agaatacctt 
tgtactattt 
aaaaagtaca 
ttatagttaa 
taaaaaaaaa 
agtcttgtga 
gtaaggcagc 
agtttgtata 
agttttctat 
tattttatta 
aacttgggca 
tcattatctt 
cacatatttc 



tcttataaag 
ctaagtttac 
gagtttctgg 
aagaaaagca 
aaagtagaca 
cagacttgta 
tcccaaatgc 
gaaaaacaga 
gcagtcctcc 
ggtgtttcag 
ataatcatta 
agaggtctgt 
taaaattctg 
tacaatctgt 
gtataatttt 
ttctgtcttt 
atggtttata 
caattgaata 
tcatgattat 
agtctttctt 
tatgcaatat 
ccactagagg 
taattgataa 
gcaagctgtc 
aatagtaagc 
cagtcttttt 
ccaagcctcc 
gttgcctcta 
acaaatacat 



gctattagga 
atatttttaa 
caaataaaat 
gtaaaaaaca 
tagaaaacga 
ctattcaaca 
aatggtgcac 
gctaggagtt 
tttccataag 
tgttttcatt 
aagaaagaaa 
atgttacccc 
acatctaaac 
atgaaagata 
ataactattt 
tgcttaaagg 
tattaagaaa 
acttaatgta 
tgtattcatt 
tttattgttt 
tttaatattt 
gctctgtgat 
atggtctttg 
ctttcacatt 
actcattaag 
cccactccct 
acaacacagg 
actgccattt 
atttggttgc 



49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
50760 
50820 
50880 
50940 
51000 
51060 
51120 
51180 
51240 
51300 
51360 
51420 
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actcgtgagc 
acatttgtcc 
taatggctgt 
aacaacagct 
aataaaattt 
tataatctct 
tcacatggac 
tgctacatct 
tcatgagtta 
taaagatact 
gattcattta 
ataaatgtaa 
acaggaacga 
agagttctct 
atgcccgggg 
gacagcattc 
agcatcaaaa 
acttatagaa 
gacttgcaga 
tccttaccct 
cttagaatat 
ctgattttta 
taaaggtctt 
ataacaattt 
ggatattcca 
aaaaaaacct 
aaaaatatgt 
acagtttttc 



aaatcaaact 
tggtcaaaga 
ttgattggtt 
tttgcaaaag 
gtgctatgtt 
gaaccaaaga 
atgacaaagc 
atttgcctta 
tggtttaaaa 
ggaatcatgt 
tgaataaaat 
agcaaatgtt 
agtagaatca 
actttgttag 
caaaaactag 
caaaagttca 
gtagaatgtt 
cctagtgtca 
acaaactgca 
tttctgtttc 
acattatatt 
agatgtggat 
gataatacag 
aaatgtgaac 
tagcctttga 
aagattttct 
gtgtatatac 
tagaaggaga 



gcattcagga 
gagcaggagg 
tacactgagg 
tattcctttc 
taaaaatatt 
gtggatatga 
ttctaacact 
gtagtagtta 
aaacaatttc 
aatagagact 
agacgctgta 
attttagact 
catttggtga 
agtgtttgag 
acagttaata 
actacaacta 
taacatccaa 
gattgttata 
gaaagtgctt 
agttttattg 
ggtttatcta 
aagctataaa 
atttgggaag 
tgattaaatg 
gaagttttca 
gaaaaagaac 
agtttttata 
agaatcaata 



aagaatacta 
actttaatta 
caatcagaca 
ctttgaagtc 
tgaaaattat 
tttttaaaaa 
gatcatagta 
ttcaactccc 
aatcacttta 
atttaatttg 
ttttctgaaa 

ggggtgtatc 

aattatataa 
ttgaccacca 
aactatgtca 
tagataagat 
gtaactgaaa 
aatgttctat 
gaattaaaac 
gagtgtgaac 
attagttgca 
gcatctctga 
gcttctctgc 
ttgaattaag 
aactatgaga 
atggagtatc 
aagaaaatat 
cagagggtaa 



ttttaatttc 
tgactttatt 
acagagaaaa 
ttattttatt 
tgattaaacc 
tcaaagtggt 
tagctactga 
cttttatcat 
cagtttcctg 
agaaatgctc 
tcattcatag 
tgttccggaa 
gtgtctactg 
tttattttca 
agaattctct 
ttgtttttga 
tcccttgaga 
ccttattagt 
tttaaacatg 
ttaactaaaa 
cctatcattg 
gctaataata 
agtcattgaa 
cccaagtttt 
aattaaaatg 
ttttactaaa 
ttttctacag 
actgctcttg 



ccttggtaaa 
caaggtgagg 
aaaatgcctt 
agcctttaaa 
aatttgtctt 
tttatttaca 
agcatcgaaa 
tgatgctgta 
gattatattt 
tttgagtttg 
tcattatctt 
aaaaaaaaaa 
tttccagctt 
acaaaatcta 
ttcaaactga 
agaaatgaga 
ctagatatat 
cacaacatga 
atataatata 
agaaagatac 
gttttttccc 
actcactaaa 
actccagcca 
agtgattgca 
tacagaggaa 
aaagaacaag 
ttttattacc 
agtcatttgc 



51480 
51540 
51600 
51660 
51720 
51780 
51840 
51900 
51960 
52020 
52080 
52140 
52200 
52260 
52320 
52380 
52440 
52500 
52560 
52620 
52680 
52740 
52800 
52860 
52920 
52980 
53040 
53100 
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catttgaggg 


atggcaaatg 


gagcaagtga 


gcgtactttg 


atttgtagat 


tagagtttga 


53160 


cacataacac 


tttgcttttg 


aatgacattt 


gcttgttact 


gtggagtcag 


tgttcatatc 


53220 


ctttattttc 


aggagttgct 


gctgatacaa 


tggggttaga 


atgagctaaa 


tacagcattt 


53280 


gctttcttgg 


tttgaattct 


gggttttaag 


taaaaatcta 


cttgcctatt 


ccattgattt 


53340 


ttttaattgc 


attcagcaaa 


tccataaact 


gcggagagag 


ctggttgcat 


cacaagaaaa 


53400 


agttgctacc 


ctcacatctc 


agctttcagc 


aaatgtaagt 


cacttcattt 


ttaaaatata 


53460 


ttacaacaaa 


tttttataga 


ggaaaatgaa 


atcattttag 


taacaaactt 


acaaattttc 


53520 


agtgcctgat 


acagacttag 


attaccaact 


agcaggactc 


ataaaaagtt 


aacatttttt 


53580 


gcctactcag 


taataaaatg 


taaatccaaa 


ctgatgagag 


gcagcaatat 


ggttaaaatg 


53640 


gcttgttgtt 


tctaataaga 


ttggaaacaa 


tagtaacagc 


catatgggtt 


acttcttttc 


53700 


ttgtttgcta 


tttttattac 


tcctcttgca 


taagattccc 


tgacaatgta 


agaggggttg 


53760 


ttagtgtttg 


actttggaag 


ataaaatatt 


cctgtgccca 


gcctccttca 


tctcaatgta 


53820 


ttgaacaatt 


tgttaagcat 


ccagttaatt 


ctaaaatatg 


aaattaggtc 


taaataggga 


53880 


tagcttagct 


gcactgtgga 


tgagatatgg 


tttgctcaaa 


aaaccttggc 


agccttctca 


53940 


tagcaattta 


aaagggtaca 


cttttactgg 


caccagagca 


gcccaggatg 


gcagaaatga 


54000 


tgacaatgaa 


gaccgtcaat 


taaattaaca 


tttactgaat 


atcttccact 


gtgtcaggga 


54060 


gcactcagag 


tagatgcaga 


atgataaagg 


agaaatgtgg 


cactgttccc 


agtcctgagg 


54120 


agcaatggtg 


ttaagaacag 


cagtgagggg 


taaggaaatg 


cctgctattt 


tgccatatgt 


54180 


cttacctctc 


tcactcaaca 


gtcctttgct 


cagttctgct 


gcatagcttt 


gggcctgctc 


54240 


tgtgcctccc 


cacccctccc 


actgctcctc 


tactgagttt 


ttctatctcc 


tagacaaagc 


54300 


atgatatgtc 


aagagtgagc 


aggtgcagac 


ccacagtgta 


agacttgaat 


aagagccatt 


54360 


tttaaatttt 


ttttaagcta 


tcattgtgca 


atataaattc 


taagtatgtg 


tatcatttca 


54420 


ttcacaatgt 


attcatttta 


gcactgtatt 


tgaattgatt 


ttattttctg 


aaatttggga 


54480 


gaattaattt 


tggatttatt 


ctatttattt 


ttaatagatg 


gtgttaggag 


attcctgaaa 


54540 
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caatatgaga 


aaataagggt 


attatttaac 


54600 


cttgttgtgt 


ttttaaagag 


atagtccaga 


ggcaaccgta 


aattttataa 


tataggctac 


54660 


atgtatagaa 


gtatgaaata 


ttgttgtcta 


ggttcctgaa 


tttgtaccca 


gaggaagtag 


54720 


aataatgtaa 


atgtcagaac 


ctcctgggtt 


gtgtttatct 


gcaataagaa 


aggctcaatg 


54780 


gcaaacctta 


tttattagat 


tgtcaggata 


cttgcagatg 


tcttgaatga 


ttactcaggg 


54840 
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tttcatttta 


tttttaatgt 


cccttggttg 


agctcatcat 


ataattcaga 


tattggaata 


54900 


ataaatggct 


gctagacata 


gtggaagatg 


ggctgatact 


ttccatttga 


aatgtaatga 


54960 


tgcttattgt 


cttcaaaaga 


aaaaactaaa 


atggtatttc 


acattttttt 


gtttttgttt 


55020 


ttgttttttt 


ttctctgaga 


atctcattct 


tactcatgat 


tattggtttc 


ttgtgtacca 


55080 


tttcaacatt 


tttctattat 


atgctaatgt 


gtatatatac 


ttaatacaca 


cgtgcaaaag 


55140 


cttccacaca 


cacacacaca 


cacacacaca 


cacacacaca 


cacacataca 


cacacatacg 


55200 


gaaccaaatt 


ctaacatagg 


ggaataatct 


tcggagtgaa 


ctctgtgctg 


ctgtttgaaa 


55260 


atggagatat 


aattttagaa 


aggttcctgc 


agttggctac 


ccacctcgtc 


tgctctaatt 


55320 


atgcttgtca 


cactattttc 


actgatgtgt 


tttcatgact 


ttagggcatg 


aattctcagc 


55380 


tgggtgttaa 


tatgaccaac 


aaagggtgaa 


aacaggttct 


tgcatttttt 


taagtactct 


55440 


ttttatgtga 


aaagcacaga 


tatgeagata 


atacataact 


gaacatccag 


catatctgtg 


55500 


gctttaaaat 


atcacgaaga 


agagcacaat 


tagggaaaag 


aaaacatcta 


tagtgtttcc 


55560 


ctaggggaac 


aatcatttaa 


aaaaaaataa 


aaataaggaa 


cacagactag 


aagcagcagt 


55620 


gecaaataga 


taattcatgc 


tagtctttgt 


gttaatttaa 


aaagtgctag 


tcttggagac 


55680 


aaacgcccaa 


attgetctag 


gttccactca 


gctgtatgtg 


ttatcattag 


tattaacttt 


55740 


tgcacgctga 


tgggagactg 


atatatatcc 


tgttttatgt 


tcctttaaac 


aatttataat 


55800 


gtaatttaga 


aaccttctca 


aatcacatta 


gatccacaca 


aaaacctgta 


catagcagct 


55860 


ttatttttta 


atagecaaag 


aaaggaaaca 


accaaaaata 


tcccttaata 


ggccagttaa 


55920 


taaacaaatt 


ctgatacatc 


tatatcatgg 


actactactc 


agcaatataa 


agaaatgact 


55980 


attgatacgt 


gcatcaactt 


gggtggatcc 


caggggtatt 


atgctgagtg 


aaaaaagaca 


56040 


gttatagaag 


gtcaaatttt 


gtataattcc 


atttatataa 


cattccagaa 


atggcaaaat 


56100 


taaagaaaca 


gagaacagat 


tagtgattgc 


taagggctaa 


ggatgaagga 


gagagagagg 


56160 


tagtgtgact 


ataggaagag 


ggagatcttt 


agttttgtat 


tttgaatgag 


atggccatca 


56220 


catgaatcca 


catatgtcaa 


tctattaatg 


taaatcaata 


ttgtattcct 


ggctttgata 


56280 


tataatataa 


ttttataaga 


tatataatca 


ttgggggaaa 


ctggatgaag 


gatacaaggg 


56340 


acctccctgt 


actatctttg 


caacttcttg 


tgtatataat 


tataaaatat 


ataatgtatt 


56400 


aaaatgtata 


aaataatatt 


ttaagtatca 


gatactgatc 


tttactcagt 


atatgaagtg 


56460 


ttctatcata 


aegtaacatg 


cttttccttt 


atttgtggta 


ttttagtttc 


aaactaaaat 


56520 
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ataaatcacc 
aggagtttca 
caaagcttct 
aaaaatgtac 
ctacttttaa 
tatataaata 
aacattttaa 
ttgttcttaa 
ctattaatta 
ttttagattt 
tcagttgtaa 
atatctgtta 
tagggacttg 
aatctacagt 
caagtaaact 
atattagaac 
aaataatctg 
gtgacatgta 
aattggtata 
acttgtattt 
tccctttgcc 
ttgactggca 
gtaaagttgt 
tagcatgtcc 
atcaataact 
aatggtaata 
gctgggtcag 
cattacataa 
ctgaatgaat 



taaagatcta 
accttaatcc 
taccatcaaa 
aaattgaatg 
aaaaaagttt 
gatatgtata 
taatattgtt 
ggaaataatc 
caagtcattt 
catgcatcat 
gaagtgctgt 
atgtatgtaa 
gtaaaattac 
tcccatttga 
ttctaccttt 
tctaatagct 
ttggttttaa 
ctctcatagc 
tcgcatattt 
taatttttaa 
caaatggggt 
gagtaaagcc 
tctgaacaga 
ttagaaagga 
caccaacaaa 
gcaaacaata 
gtgggatgct 
tttggaaatg 
gtgtaagaaa 



cgacagttct 
tctctttagt 
tacatttact 
acagtcaaaa 
attaaaattc 
taaatgacta 
ataactatgg 
atttttacat 
tgaactcatt 
aatcagccat 
gtttaaattt 
tatacacata 
ttatttatag 
aagtaaaagt 
tggttaaatg 
ataatttata 
caacattttt 
ttacgtgctt 
aaacttggca 
atttgaaatg 
atggtaagtc 
cttgttcagt 
gctgcataaa 
gagcggttat 
cagaaaacag 
agccagatag 
tttgtttgtc 
gacacaaggc 
acaaaaggtc 



tttgaaaaaa 

gtttctttat 

gactaaaaat 

ggtacaggta 

tcttttagac 

ttaaacaatt 

gggtaaaatt 

atccaggaat 

cattttcttt 

caaataattt 

cattcagaat 

tttttaacat 

gatattttaa 

aagtctttgt 

tgagtgcagc 

gggatgaatt 

aagtatcaga 

ttcctttatt 

taattacatt 

catctatgtc 

aagagagtct 

agaatgtgtg 

accacaggta 

attggcaggt 

aagccgcaca 

cctctggcct 

ttttaacgta 

tagttattac 

ctttttgcct 



aatcttgctt 
ttggtagtga 
agaaaaataa 
atgaagatat 
taatgcagta 
ttaatgtcag 
ttgtatatat 
ttgaattact 
gtgtttgctt 
agttaatact 
gtttcatttc 
gcatgtactt 
atataatcaa 
ttactagttt 
ccccacagtg 
tcaatgagtt 
tattcatctt 
tggggtgttt 
tatatggact 
tctgttaaaa 
ctagttagct 
ttaagccttc 
aagtgttaag 
cctattgcct 
aggaaaggca 
ctcgcccaca 
ttttctttac 
taacattttt 
ttcagcagat 



ttaatttccc 
tagggactat 
tttacattgt 
gcattaacat 
tctgggaatt 
ttatatttta 
ctgaacattt 
ctcaagtcac 
tataatgtca 
tgatttttcc 
atctgaatta 
aaattgatta 
ggatttttta 
gttcacagta 
agaaattgtt 
tggttctaag 
tactcagtat 
tttatatatt 
ctaaacaata 
tgcatttctt 
cacctctcat 
cctccctttt 
ctgattctac 
ggcgtttctg 
gaactaaata 
ccttaaggca 
aaatctcagc 
aaagacatta 
aagtctttta 



56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57960 
58020 
58080 
58140 
58200 
58260 
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accaaaaatc 
cataaaattt 
catattatta 
tgaaacttta 
gtcttaaata 
tatttataca 
taataaaatg 
tcataacata 
tatatgcacc 
ctactatata 
ttactatata 
ttgtcagtgg 
tgatatttca 
cttttgcacc 
gatattccct 
acacaagtca 
tgcaactgct 
tccatatttt 
tttaatcagt 
aaaaattttc 
ctcataactt 
aagacaatat 
tttgttacat 
gatattaagc 
ctgacaggcc 
tcccacttat 
ggataatggc 
tagctgcata 



tcttgggtat 
gctagttact 
atgagaaaca 
tgttgcagaa 
tatataaagt 
taatatacta 
ctctaggcat 
catatataaa 
tatatattta 
ttactatata 
atatataaat 
gctgtctctg 
tcttgttgtg 
agcctaatac 
ttactaatac 

ggggatattt 

aaaggaaaaa 
ttaaaagtca 
tttactttcc 
ccttatttgc 
ctaaagtagc 
tttttttctt 
gggtcaacat 
ccaccatcca 
ctagtgtgtg 
aagtgagaag 
ttccagctcc 
gtattccatg 



tttgagattg 
cttgcttttt 
cttcagtgtt 
tcattcttgg 
ttatatgggt 
tatatttata 
atatgtgtgt 
atactatatt 
tatattacta 
atatatagta 
atatgtgtgt 
taatctatat 
tggcatcata 
attgtatgat 
tcagtcttgt 
atcatcatat 
tcagaaccaa 
ttttatcaag 
attttctctt 
ctccttttcc 
aatgttaatc 
caacttttaa 
atgccatagt 
ttagctattc 
ttgttcccca 
aagtggtgtt 
atccatgtcc 
gtgtatatgt 



tgttctactt 
tgcatctctt 
tggacaattt 
ttcaactaac 
aaatatatat 
catgatatac 
gtgtgtatat 
atatatactc 
tataatatat 
tatatatagt 
atatatatat 
gaagaaataa 
gtaattctct 
tccaaaacca 
ctattttatt 
taatacaatt 
ataaactgtc 
tctctcagac 
cattaaggtg 
atttacccta 
tgataaacta 
gttctggcgt 
gatttgctgc 
ttcctgattc 
ccatgtgccc 
tggttttctc 
ctccaaagga 
accacatttt 



ctttgcttat 
ctaagagaaa 
tttgtagtgg 
tactaatttt 
tacatataat 
taaatatttt 
atgtatatat 
taggtataca 
agtatatatt 
atatattata 
atgcctagag 
aatgtagacg 
ttacatatct 
aagagagtat 
acctttatag 
ttactctgac 
attaacaacc 
aagatgtgat 
ataaagatta 
ttgagtgaga 
aaccaaggtg 
acatgggcag 
acagatcaac 
tctccctccc 
acgtgttctc 
ttcctgtgtt 
catgacctca 
ctttatccag 



ttaatatttt 
acaattggtg 
aaaagaaatg 
aaaacataaa 
atatgtttta 
cccatataaa 
atatatacct 
tatatgccta 
actatatata 
tagtaatata 
tgtttttaat 
ttatgtataa 
attcagatta 
ggattgaaat 
acttcaccta 
cttaaaatta 
cccctgaaaa 
accctataag 
tcattagtag 
aatttagcct 
agataaattt 
gatatgcagg 
tcatcgccta 
ctaactccca 
atcgttctac 
agtttgctga 
ttccttttta 
tttatcattg 



58320 
58380 
58440 
58500 
58560 
58620 
58680 
58740 
58800 
58860 
58920 
58980 
59040 
59100 
59160 
59220 
59280 
59340 
59400 
59460 
59520 
59580 
59640 
59700 
59760 
59820 
59880 
59940 
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gcatttgggt 
tgcaggtatc 
ttactgggtc 
gttgaactaa 
tgcagcacct 
ctcattgtgg 
cttgtaagtg 
gtttgttttt 
gtcaaatgga 
gatagtttct 
ttgttgcaat 
taatattgcc 
ttaatccatc 
ctgtatatgg 
ttgcttgttt 
aatcataaca 
ataagttttt 
ttatagctag 
ctttgtcaaa 
aattttcttc 
ccactataag 
tgctaataaa 
ttttaggttt 
gaagtgacca 
taaaaactaa 
gaaaaccaca 
tctaaacata 
aaataattac 
gagagggaca 



tgatttcatg 
tttataatag 
aatttctgct 
tttactctcc 
gttatttctt 
ttttgatgtt 
tctattcata 
tacttgcgca 
tagattccac 
tttgctatgc 
tgcttttgga 
cagattttgt 
ttgggttaat 
ctagtcagtt 
ttgtcaggtt 
tacatttcat 
ttgtgcataa 
aaaatgtgag 
ctttaattca 
aagtttcaca 
gagaattaaa 
gggtttctgc 
tagtaaccaa 
tcatgggcat 
cttatgctac 
aaagaatcat 
gtgctttaaa 
cacagaagtt 
aagaagagct 



tctttgctat 
aattatttat 
tccagatctt 
caccaacagt 
gactttaata 
accctttttt 
tcctgtctat 
tttgttgaag 
aaatgttctc 
tgaaggtctt 
gtttttgtca 
tctagggttt 
ttttgtataa 
ctaccagcac 
tgtcgaagat 
tgaaaacaac 
ttacaaactg 
taatatagtt 
tattaaattg 
tttaaggatt 
atggaaatat 
cataattttc 
ttttatgggc 
atatatatat 
ataccacatg 
gccatttgat 
gtgtaagctt 
atgtagaggt 
tcttggttct 



tgtgactagt 
attcctttgg 
tgaggaatca 
gtaaaagcat 
atcaccattc 
ttatatgttt 
tcatgtcttt 
ttccttgtag 
ccattctgca 
taattagatc 
taaaatcttt 
ttatagtttt 
ggtataagga 
catttattaa 
cagatggttg 
acgactcaaa 
ccattctaat 
tatacagcat 
ataaagtata 
catagtagaa 
tgagtaaaat 
cttattttaa 
atagtgggaa 
tttaaattca 
gatgaatctc 
tacacttggg 
gggtaggaaa 
tatctttggg 
tgaaatgtcc 



gctgcagtga 
gtatataccc 
tcacactgtc 
tcctttttct 
tgactgctgt 
gttggctgca 
gcccactttt 
actctagata 
gattgtctgt 
ctatttgtca 
gcccttacct 
tggattttac 
agtggtccag 
ttgttttttc 
taggtgtttt 
atgttcttta 
cataaacatt 
actctttaca 
cacaaagggt 
tgattaaacc 
cttacatttc 
aagaaaacac 
tatttctaac 
catatatgaa 
aaaacccatg 
tggtttttaa 
aactataaag 
gaaggaagag 
tatttcttga 



acataatgca 
agtaatggga 
ttccacattg 
ctgaaacctc 
gagatggtat 
tgactgtctt 
taatggggaa 
ttagaccttt 
tcactctgat 
acttttgctt 
atgtcttgaa 
ttgtaagtct 
ttttaatttt 
agtttcccca 
tcactaacat 
gtaaccagtt 
ttgtggttac 
atcccgattt 
aaaggagagt 
ttacatttct 
atttagtaag 
acaattttag 
aggttaaact 
tactatacag 
taaagcaaaa 
aacaggcata 
aaaagcaaga 
ggaataataa 
cttggctggt 



60000 
60060 
60120 
60180 
60240 
60300 
60360 
60420 
60480 
60540 
60600 
60660 
60720 
60780 
60840 
60900 
60960 
61020 
61080 
61140 
61200 
61260 
61320 
61380 
61440 
61500 
61560 
61620 
61680 
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gaatgcatga 


atgttcacta 


tgtgataagt 


cagggggctg 


ttttcatttt 


gttcactttt 


61740 


atatatgtgt 


ggatttttcc 


acagttgaaa 


aggtaaagtt 


caggtgtggt 


ggctcacacc 


61800 


tataatccca 


gccaacactt 


tgcggggcca 


aggtgggaag 


aattacttga 


ggctaggagt 


61860 


tggagagtaa 


cccaggcaac 


agggtgaggc 


actgtctcta 


cagaaaatga 


aaaaaaaaaa 


61920 


aaaaaagtag 


ctgggcatgt 


tggtacatgc 


ctatagttct 


tgctacttgg 


gaggctgagg 


61980 


caagaggatc 


actttagccc 


aggagtttaa 


gcctgcagtg 


aactagggtt 


gtggcactgc 


62040 


actccagcct 


gggtggcagc 


aagacactga 


gtaaaagaat 


aaaataaata 


attaaaagtt 


62100 


aaaatatagg 


aaaaaatgag 


catagcctta 


tgctaatttt 


tcagttacta 


ggtctgatat 


62160 


catcacattc 


cttgcttgtc 


attgaaaatt 


ttttaaacta 


tgatactttt 


ttttagtggt 


62220 


atttatccaa 


ttaaatctgc 


taacaaattt 


ggtgtataaa 


tctcaagggt 


aagggtatgt 


62280 


ggagagtggg 


tgtgtttgtg 


tgagagagag 


agagagaaga 


gggggaggag 


aaaaagaagg 


62340 


aagagggaag 


gaatggaaaa 


agataataaa 


gagttgttct 


gatagattaa 


tctttagtag 


62400 


atgtattccc 


tacaaattgt 


ttttctccat 


attgcagtgt 


caggtaaaga 


aaggcatccc 


62460 


aggatgaatt 


cagagctagg 


aacatgcacc 


tttgtatcat 


aatgctaatg 


gaaggaacat 


62520 


gtacattcta 


actgttacca 


ataatggaat 


atatttccgt 


tattaagtaa 


taagctttaa 


62580 


ttctttgtat 


ttttgtgatc 


catttgatag 


taggtgcctc 


agcatttcca 


ctctgctata 


62640 


agtacatgga 


gatatatttt 


atttaagtca 


tcttattcat 


gtctttcaaa 


aagaaattca 


62700 


tttttggcca 


aggatttcca 


aattttgccc 


catatatagg 


tatagtttat 


tatagacttc 


62760 


gtttgcaaaa 


tattaaatcc 


ttatatcctt 


ttagggacac 


aataaaattt 


tataagtttg 


62820 


agataatgta 


cttgcagttc 


tacctcaggc 


cgtggtgaga 


gattgaagtg 


cctcttcatt 


62880 


ttaacatttt 


gggttcaagt 


tgttgcataa 


gggcatgcaa 


atggaaactg 


gcctattttt 


62940 


gagctttaat 


aaaatcgtca 


aatacttctt 


aatcttaaga 


gttatagtta 


tgtactacaa 


63000 


tatgtataat 


tctctaatat 


ttaaaacaaa 


acctgaaagc 


cacaaaagct 


tactgtgaaa 


63060 


taaaatgtga 


tggaatatta 


tttctaactg 


gcttacctgt 


atttccttca 


ttgaagggaa 


63120 


tatgaagtag 


aaaagccctt 


ttattgaaaa 


gagtttggaa 


agtaaagata 


actcttttca 


63180 


attcaattct 


ttgtaagtag 


aaaaagagta 


aagataatgt 


ttagctgtca 


gcagatgtct 


63240 


gacacttgat 


ggagcgtatc 


attacaatag 


agcagctaac 


aatatctgca 


aaggtcatca 


63300 


tgaaagtata 


aaaatgagga 


atatttgtcc 


attgaccatt 


tcagtgacct 


ctttttgggc 


63360 
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tttaagtcta 
tctacatgat 
gttaaactca 
aaaagatgaa 
catagtattt 
atccatccac 
accttgtagc 
ctatgacagc 
caaattgcat 
agtgcagagg 
taaggtcagg 
cacacacaaa 
gctgaggcag 
ccactgcact 
tcaattctat 
attttatcag 
tctattaatt 
cagtcacata 
tgataagaac 
taagacacac 
ctatgaaaca 
ggatatcacc 
gattttatgg 
agacaaatgt 
accagttagc 
agagataggt 
gtgtgtgtgt 
tttcatctaa 
gttcctctat 



aaaatcttgg 
gtgcaagtca 
cattaaatgc 
tgaaaatgtg 
aaagagttgg 
ataagattct 
agcttttgaa 
ggaacaaaag 
aggattctta 
ctcatgcctg 
agttcgagac 
aaattagctg 
gagaatcact 
ccagcctggg 
actaaattaa 
tttactccat 
tacttctgga 
agagaatatt 
tttcacaaat 
aggaaggatg 
aaggccaatc 
tttgaagaaa 
tatgaaagaa 
attaaaaagg 
ttacttctct 
gaaacctata 
gtgtgtgtat 
ttctcaaaga 
ctatccatag 



cagatcagaa 
gaaggagtta 
caggagactg 
tgttaaagaa 
aacacatgaa 
gacatttaaa 
aagagcttag 
gtatgttcag 
aataatacct 
taatctcacc 
cagcctggcc 
ggcatggtgg 
tgaacccagg 
agacagaaca 
aacaatgata 
attggaacac 
tggaactagt 
ctgggggcaa 
acatggtcac 
cctgggactc 
ctgtgtaatt 
agtcatttgt 
ggtataccaa 
tccatactgc 
gctgttagtc 
gaattctatg 
gaaaactgta 
gacccattat 
gaatggcttc 



ctttatattc 
ttacttgcaa 
aaacactgat 
gagtcaccag 
attaagcatt 
ctatgtttct 
ggaatatgac 
aaattgccac 
gaagctcctt 
actttgggag 
aacgtggtaa 
cgggtacctg 
aggcagagga 
aaaaaaagag 
atacctttct 
acaaaggcaa 
aaaagaatac 
actgttatgc 
taaatccagc 
tgccaagtaa 
ttgaaataac 
tatatcaaaa 
aaagaaagga 
atagaaagcc 
cagtggcctt 
gagtgtgtgt 
aatgtgcata 
atcagaagtt 
agttttgtct 



ggcattttga 
aataccatct 
tttaagaaga 
tcagagctaa 
ttgtaaaatg 
tccattctgt 
tggccgattg 
tggagactga 
aaaaataata 
accaaggtgg 
aatcccatct 
taatcccagc 
cgcagtgagc 
taataataat 
tttcagattt 
acaaaatcct 
tgaatgttaa 
agttgacaag 
tatagggcat 
gggacttcag 
aagaactagt 
tacttaaaat 
acggagaatt 
tggtcacctt 
aacttccttg 
gtgtgtgtgt 
aatgatcagg 
ttgggtattt 
ttagattctg 



gtgtcaaatc 
tctttcagaa 
caaagtttag 
ctatgatagt 
aaggcttttc 
tcacaggctc 
caaagtctaa 
aagaagacag 
ttccaggctg 
gtggatcact 
ctactaaaaa 
tacgcaggag 
caagatcaca 
aaaataatat 
taatttaaag 
tgctgggcag 
gaaagagaaa 
aatcacactt 
ggctgtaggc 
gttacagcag 
tgccatctag 
gaacctaaag 
tagttcacga 
tcctgtgatg 
gataggtatc 
gtgcgtgcgt 
tgtccagagc 
tcaagaatgc 
taagttatgt 



63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
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gattagcttt 
ttaaaacaga 
ccaagtatta 
gagaatatta 
cataaatttt 
ttttctttcc 
cttccttctt 
ctatgtaata 
aagaataaaa 
agctgggtaa 
aaagtgggga 
tatcatgtca 
cccttgaaaa 
ggaaatgagt 
ttgtcctcca 
aaacatacag 
taatgcctat 
taatgtgtct 
tatcagaatg 
ttgaattgca 
ataacttgcc 
tggtataatt 
cctcagccta 
tggtacaaag 
tttagactga 
tttaaattgg 
aaaagctaag 
gataaaaggg 



acaaaagtag 
atgaatagtt 
ttagcacatt 
tccattcctg 
cctttctttc 
ttccttcctg 
ttccccctct 
taagaaccca 
acactggccc 
tcttgagcaa 
tgataatgac 
catgctaggg 
ttataagata 
gttagaattt 
cccaatattt 
agaattttct 
gattaatgcc 
acataatcct 
tctcatattg 
aactgtatac 
tattctcaaa 
caaagacaga 
gataattact 
ggagattgtt 
ggagaaaact 
ggtcatggaa 
ggcataacat 
tatattgctc 



tatgtattac 
caatgaaatc 
gtattctctc 
gaaaatctgt 
ctccctccct 
cctcttttct 
gtctccctct 
gcaaatagaa 
ccagtattct 
gtttatctaa 
taccttgtag 
gctaccaaaa 
acaccaaatt 
aagtctcaat 
tcttgaactc 
catatacata 
tgaatattta 
taaaataaca 
tccttatcac 
acacaacaat 
gtgatattta 
atctaattaa 
actgcaaggc 
aggtgtgttt 
ctttggcact 
taacaaaaga 
ttaatcatat 
tctggtgatt 



caaattttgt 
aaaagagtaa 
tcatattctc 
tctagcacag 
ccctccttcc 
atccttcctt 
ctttcttttt 
ttagaaggct 
tgaatgagaa 
cctttcttga 
gatcaccatg 
aatattacct 
cctcactggg 
atctttaata 
tgttatatct 
tagaaaaaaa 
aaaataattt 
ttgccaaaat 
ttttttaact 
catggttaac 
aaagcctata 
aaccagtaga 
actgaaatga 
tattctcatt 
aaattcaagg 
caaaatcact 
ttcactaaag 
caacaatcct 



cactttacaa 
atcgaatatt 
cgtataccct 
ctaacaaact 
ctcccttcct 
tctcctccct 
tgctgcagct 
ttttagagca 
ttctggctat 
acctcaaatt 
aggagtaaat 
tcctttacat 
catataccaa 
agtcaaaatt 
gtaagtgaat 
tgtagaggta 
ctataacata 
tataaaattt 
gaaaataaaa 
tagtttatta 
aaattatttg 
ataatgtata 
attgaatttc 
ttctgaccag 
acgaatttat 
gttcaaatag 
gcatttcttc 
gagaaaaggc 



aagtttattt 
cttataattg 
gcccgtgaga 
ccttttgaaa 
tcctttttcc 
tacacccttt 
tgtcacttca 
gctgacggga 
gtctgttaaa 
caccttctta 
cagatactgt; 
ttctcttttt 
gcatattgtt 
aatagaattt 
tttctcatag 
tgttaatgta 
agagatttta 
tctcagaaga 
tcacttcttt 
atttgagatt 
caatgtgaaa 
taacaatata 
aaggaagcta 
gagagcataa 
tgccaaggtt 
acatttctct 
agggagctga 
ttgtgaagta 



65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 
65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
66240 
66300 
66360 
66420 
66480 
66540 
66600 
66660 
66720 
66780 
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tagagcagag 
attcatcaag 
gatatagaat 
aagcatatta 
atatatatgt 
aatataatta 
gccaaggcag 
aaccccatct 
agctattcag 
agccgaggtc 
aaaatatata 
tataaattac 
ggatttcagg 
aagctggttt 
ttacatactt 
cagtgtgatt 
ttgatttata 
ctatttccag 
ttttatatac 
aaaattatca 
cagaggcagg 
cactgcactc 
aaataaaaaa 
cccaatagct 
taagagaatt 
ctaacctagg 
tttcagatcg 
cctgatgccc 
atggtactag 



attcttaaac 
tttatattga 
gatagatata 
tacatgtatg 
atctctttat 
aaacaggcca 
gtggactccc 
ctactagaaa 
gaggttgagg 
atgccactgc 
tatataatat 
atatttataa 
tgatgataag 
ggaagctaaa 
taaaaatgga 
ttcagcaatc 
gaccaaaata 
atgccaaata 
ttatagcctg 
tgctattaga 
agaattgctt 
cagcctgggt 
taaaaaatag 
gctaaaagaa 
gttaatgtgc 
actcagtaga 
ttttctcttc 
tgaggcatcc 
ctatgtacca 



tcccttcccc 
atttgtgctc 
ggtctatatc 
taatattcct 
catggagttt 
ggtgtggtga 
tgagatcagg 
caaaaattag 
caggagaatc 
actccagcct 
atataatata 
atatgtaatt 
cactactgaa 
acacaaagta 
taaactaaat 
acatggcatt 
actcccttga 
acacggaatc 
gtaagaaaga 
ctcagtacaa 
gaaccctgga 
gacagagaga 
tgatcaatct 
gacatgttat 
ttgctgggga 
ctatgatatt 
accagcactt 
attatatgtg 
ctcttgtcag 



aagttataag 
ttctaatgac 
tatagacata 
catggagttt 
atattttagg 
ctcacacgtg 
agttcaagac 
ccaggcctgg 
acttgaacct 
gggcaacaga 
tatataaata 
tatatatata 
aaaagtaaag 
acaaaggcca 
taagactcag 
aatagcctga 
atagagaggg 
tcttgccaaa 
aaagacatgt 
tgtcatgtgc 
ggtgaaggtt 
gactgcatct 
ggcagcattt 
ataatactaa 
gtgaaattat 
ggcgtagttt 
cttcaccagg 
ctgaaataac 
aatcaaggaa 



tttcatttgt 
aaaacagtac 
cctatctact 
atatttctca 
aggtcacaga 
taatcctagc 
cagcctggcc 
tggtgggcac 
gggaggtgga 
gcaagactct 
tatatattat 
atatataatt 
ctgagaatga 
aggtggttac 
attctagtct 
aactacatca 
attcactcct 
tttgtgtggc 
atgaataact 
attctcaaag 
gcactgagct 
caaaaaaata 
tctgaaagtt 
gtctgtaagt 
ctctaggcat 
gaccaagaat 
attatatgaa 
ttcttttctc 
attgctactc 



ctatatagtc 
agacaatata 
agaactctaa 
tatatatctc 
tgataataaa 
actttgaaag 
aacatagtga 
ctgtagtccc 
ggttgcagta 
gtctcaaaaa 
ataatatata 
aaaacatata 
ggatactgag 
atgttcttga 
ttgggcttca 
aaattgtcat 
aacacttttc 
agaacactgg 
tagaaggcag 
gaaacatctg 
gagatcatgc 
aaaattacaa 
aagcagtatt 
aggtaaaaat 
taccctatac 
tttatcctga 
aaaaattaaa 
aeeatctaga 
aaatcattgt 



66840 
66900 
66960 
67020 
67080 
67140 
67200 
67260 
67320 
67380 
67440 
67500 
67560 
67620 
67680 
67740 
67800 
67860 
67920 
67980 
68040 
68100 
68160 
68220 
68280 
68340 
68400 
68460 
68520 
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gcagcttaat 
aactatattt 
ttactaaagc 
ttagttccct 
tttcttgtta 
cagatattga 
gaaacatttt 
aaaccaaatc 
acattaactc 
atattgetgg 
aaaattttca 
tgcaataggg 
ggagtgggga 
agtgccaatg 
aaggtcaagt 
agtctttgtt 
aggccaccag 
teaagectga 
acattgactc 
aagctcagct 
cttagtttgg 
atatgatctg 
tctttagtat 
aatccatatc 
tctcaaaact 
tttaaaaatt 
cctcatccaa 
ggecacatta 



tttctcacag 
tttataagta 
tatattcaca 
gtagctgtca 
gaaggtcaga 
ctcctatgct 
tttgggtgac 
tttgaaaata 
tccctaattt 
agtcagatga 
aattatttac 
atggccaggt 
aaggaggacc 
tgaggaccta 
tagttgatga 
agtgagaaat 
gaaaggcgaa 
cctgaacgaa 
aatttagctt 
ggtgttgggg 
ttctcaggtt 
attatcttat 
ttaatgagaa 
ttgatggcat 
tgagtatctt 
tctaatgatt 
actccccact 
caatttctat 



aaggccagtt 
acatttttac 
gctaaatatt 
tgecaaggea 
gcagggcaga 
aaccataaag 
tacaaaacag 
gataatgetg 
aegttatgat 
tgcatccatt 
actactggta 
tatttcagta 
tgtttaagga 
aaaagcagat 
cacttgatga 
ttggaaagag 
tattctgaca 
ttgtttagcc 
ctttaacatg 
gcatcattat 
ttaaaatatc 
tcctaaaatc 
taagaaactg 
atacattttt 
ctatgtttcc 
ttatagttag 
atggacaaaa 
gcattataga 



gagaaaggct 
agaactactt 
tcagaattta 
attagaacat 
gaagtagece 
cctagtagtt 
gaattgaaac 
caactaaaaa 
atttgecate 
aatctttggg 
tttggtcaat 
gaacaactag 
aactagagct 
gtggtggaaa 
gattgtccca 
atcaggtata 
caaaatttga 
tcagatacat 
agaaactttc 
cttgaatagc 
tatcattttt 
etttgeagaa 
gaaatgactg 
cagttttttt 
cttcaacata 
ctatcttggg 
ataaaataaa 
aacctgagaa 



caacttctag 
ctaaatcctt 
aaatttaaaa 
atgttaaggt 
cttgtatgag 
tgctcatttg 
cttcaaaata 
tttagttgaa 
tagaagtgtt 
gcatagaata 
gtaatttatt 
caagacttca 
gggaagtgtg 
atttaaacag 
agctttggga 
gttaagaaac 
tcattttatt 
gcataaaact 
gtggaaaact 
tcactggagg 
gaaaagtgtg 
ttatcccagc 
aattggaaga 
tctaaatgat 
aagaaattgt 
aattcatttc 
aattattagt 
aatgtatctt 



gaatccagca 
gtgttcaaat 
gactttcaaa 
atgaggggtt 
tgatgaagct 
ttacctctct 
agggaatttg 
taagattttt 
tttaaaaaat 
atgtgaatct 
tgaaactaga 
gatgcatggt 
agattaactt 
gettgectag 
ttctcaacaa 
tgggttggaa 
tggaagcatt 
gtgaaaagag 
agaactttac 
aaaatgaaat 
aagtaacaaa 
ctcaatcttc 
gtagacttta 
taatgaggat 
atgaaaatat 
taatcatgta 
tgcatctgaa 
aaaaaataaa 



68580 
68640 
68700 
68760 
68820 
68880 
68940 
69000 
69060 
69120 
69180 
69240 
69300 
69360 
69420 
69480 
69540 
69600 
69660 
69720 
69780 
69840 
69900 
69960 
70020 
70080 
70140 
70200 
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tgtgaacaac 


taaccataat 


tatgaagaag 


aaaaatgaaa 


actagaaata 


aactattgaa 


70260 


aaatgtctat 


gtatcagtta 


agtttttatt 


ttaaaattct 


ttatgtttat 


ctctataata 


70320 


ctattgggaa 


agagagaaag 


gaaaacctga 


ctttgttctc 


atccaaagga 


ggtgattcca 


70380 


ctgatttagc 


caaaataaga 


cttcctggtt 


ataataaata 


ataaagtttt 


tgatgttttt 


70440 


tatatggtac 


cccactcact 


aggtgatcag 


acaccctcct 


gcaaaaaaaa 


aaaaaatacg 


70500 


tatgcaataa 


agttaaagtt 


ttatgttatt 


ctttcaaggg 


gagaaacatc 


tgtttaacac 


70560 


agaccagaat 


atttcaacaa 


agtcatccca 


atatttatgg 


agatcataaa 


tcaagcgaaa 


70620 


aaatatattc 


atcaacaact 


aaacaaacta 


cattaaatag 


tctcaaagca 


cattttcact 


70680 


ttttttctga 


caggaaaaca 


ggtttcacaa 


gtgtggagac 


attttaccat 


ggcttttaac 


70740 


agtgaggaag 


gatgtttaaa 


taaagggaaa 


aattatatgg 


aaagctcaga 


gaaaagagat 


70800 


gggtgtggct 


tgagtgacaa 


ggtgagagca 


gatctcatta 


actgaaatga 


gagagaagga 


70860 


aggaattttg 


caaatatgga 


aagataacta 


gtgcaagttt 


gaacagatta 


tgtcaatcaa 


70920 


tgtagaattt 


ggctatcttt 


ttaatcaaag 


aagactatgg 


aatattttat 


aggtgtttgc 


70980 


ttatactcaa 


agttttaaag 


aaataacagt 


atgaatttgg 


ttgaactaat 


ttttttcata 


71040 


gataggattc 


tcccaagtta 


tatagcatat 


atatttctta 


actagttatt 


cttcctttta 


71100 


catatattgt 


gccacattga 


gtaacaacta 


acctgctaat 


agctattggt 


ttttaaaaga 


71160 


taattaatat 


tagaaagtga 


tcatttttct 


gtttcatatt 


aaacatgata 


ttctgaaaaa 


71220 


gcaacattgc 


ctgaatgttc 


tacattttat 


ctttttgaaa 


acaggtttta 


taagagattt 


71280 


cttgtgaaaa 


gctgaacgtt 


ctgacactga 


aataagtcag 


ctaactcaaa 


gctaagctta 


71340 


attttttgac 


actgttggca 


tgaggtctca 


ttcccaattt 


tttcatttaa 


agccacaggc 


71400 


aaatgtttta 


acagatttta 


atccgtagta 


caagcattat 


tgatcttaaa 


tttaaggata 


71460 


aaaacctgat 


tttaattaga 


atttaatatg 


cattctagta 


tttacgttgt 


ataattaata 


71520 


tttacattcc 


atgattccac 


tatgtaccat 


ttatttcttt 


ttgaataaat 


ttccagtagg 


71580 


agcagaataa 


attttcagtg 


aatattttat 


ttcttggggg 


atatttttaa 


atggaaaata 


71640 


tattaagttt 


cggtaaaatc 


tgttgctaat 


ttggcagtgg 


acagaatata 


aaaattggag 


71700 


agactgagtc 


attatgatga 


attgggtctg 


acttttgtca 


tgacactgga 


aatttcccac 


71760 


aaatattata 


ttcttctttt 


ataataaata 


tagtcgaaat 


gaattgcagt 


caagtatttg 


71820 


aagacccatc 


tataaattta 


ggcggttact 


gttgattttt 


cat tat gaga 


gattcttcca 


71880 


ctcataagct 


actaaaagta 


cataaagaag 


gtctggttgt 


ttgttttaaa 


tgtgactgtt 


71940 
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ctctatcagg 


aaaatgtcag 


gtatccgatg 


aaaatagata 


tatgaggtgc 


caggtatcta 


72000 


ttccaaactt 


ggatatcact 


tcaattagca 


tcatcttttt 


ttttttttaa 


agtgtctaag 


72060 


gttagaatag 


tcaccagata 


ttcccatgta 


tgaagcaatt 


ttctgcaaag 


gccgctgtgg 


72120 


atgatctttt 


taaaatatat 


attctgggag 


acattgagta 


aagagaaatt 


atttaccaga 


72180 


gaatgaagaa 


ccgaggcccg 


attctttggc 


tttctgccaa 


agatgctgaa 


ggcagtgatg 


72240 


aatgacaaat 


acattaccaa 


ggaattctcc 


ctctaagagg 


ctgacaaaga 


tctgat tttt 


72300 


aggattatat 


taccaccaag 


aagatacccc 


ttgtcactga 


gcttctaatg 


gaaatatggt 


72 3 60 


ctatactgaa 


acaattctca 


gttctttttc 


tttctatctt 


tttttgagtt 


attttatctt 


7242 0 


ccaaaaatga 


gttatttctg 


ataaaataat 


tcacttaaat 


aattatgaaa 


gttcaaattt 


724 80 


gtgcaaatat 


ttttattggg 


acatcttaaa 


attactctaa 


attcaaaaag 


aaaatatatg 


7 2 540 


ctttattaaa 


atttgatctg 


taagctgctt 


tgtttgtaat 


ttaactatta 


tataaaaatt 


72 600 


gtataataca 


tatattttat 


ttactttatt 


cctgtgttgc 


tttggcttgg 


tgagactagg 


72660 


tctccacatt 


aggagtttta 


ctgaatgaaa 


aagtatcaga 


atgtaacatg 


actttgatat 


72720 


ggcatcagaa 


tttaataaga 


tgacatttaa 


taggaattag 


gggtaagttc 


caggttttac 


/ 2 / o U 


acttaaatac 


aaataatcaa 


ttttgcaggc 


acaaaatact 


tcaaacaaaa 


tctgaaatca 


72 84 0 


ttcatttgac 


aaaacttcag 


gtttgcagtt 


gacaataaat 


acaatacaat 


gcaacagtgc 


7 2 900 


aatagtgata 


tctaaatatc 


taatgtaatc 


ataggtaata 


ttagtaagtg 


tgttatctga 


72 96 0 


aatgagtggt 


gtgatatcct 


gctttacttt 


gtactggtga 


gttctgggtg 


ccacctttga 


73 02 0 


aaggaataaa 


gactattcat 


atctctttta 


taagacaata 


agaaaaacaa 


acaaacaaac 


T 1 f\ O f*i. 

7 3 0 8 0 


aaacaaaaaa 


ccacctcctt 


tactttagct 


gagaaagaag 


ttattaggta 


cagcttgaca 


7 314 0 


agttcagcta 


agcatccaaa 


tcttccagga 


ggttgttact 


acataaaat c 


aaaccttttt 


*"7 i *~i r\ r\ 

732 0 0 


aattcaacta 


tgagcaggga 


gattttattt 


ttctttcggg 


tactaaagct 


tccaaactct 


73260 


gtttattcca 


caggaatctg 


aacttataga 


actaagagaa 


accattgaaa 


tgctgaaggc 


7332 0 


tcagaattct 


gctgcccagg 


cggctattca 


gggagcactg 


aatggtccag 


accatcctcc 


73380 


caaaggtata 


tttagaaatc 


atttcatttc 


cacccaatat 


aataggcatc 


tattttattt 


73440 


attaattaca 


gtagaactgc 


atttactcag 


tgtcactgtg 


cattattaat 


acatactagt 


73500 


tgtattaata 


gttgtattaa 


tacatactag 


tagtattaat 


acatactacg 


ttggtattaa 


73560 


tgtgatcaga 


atcctagaat 


tttagaacag 


tgacttccat 


tatcagataa 


tttttaaact 


73620 
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gatcttaaga aatttggttc tatagttgta tacacatctc tctacttgat tcagtggaga 73680 

tggagatgga gtggttggtt aatacatgca tatctgactt caggcaaaac aaacccatta 73740 

atgagtatga taatctagat ctgtatttaa aaatgaaata gtcaatatga tgatatagta 73800 

agcagtgggc attgggaaca acttttcctg gatggaggct ataaaaaggt acatttcctg 73860 

tagataattt tgaaacaata aaaacaacgg gtgaaaggta gctctgtttt aaattattcc 73920 

tatgcttaag caattctaaa caatgaaagg ggtatttctg ccactgcccc tacccctggg 73 980 

ttcaccactg aagaaatgct cattattaat atcgtgtcat ttttttcctt tacattggtt 74040 

ctatttactc atttcctgac acttttcaat ggccttcagt gagctcagct ctttcccagc 74100 

ttaaaaaatc ctgtcctaaa acatgaatgc cttattatct ctcttttcat ttccagaaga 74160 

attctgagaa aaattttatg aagtctttca atgtcttcag ccatctttag accactggag 74220 

tgtagctcct tttccctcca ctccaccaaa acaatgctct ccaggatcag cagaaactta 74280 

catgacacta aattcagtaa aacgtttata attcttattg tattagacag acatggaaac 74340 

agcatttgat gctgatattc atttcttcct atgtgaaaca tccggttttt ctaatgttcg 74400 

tgacatcata cattcttggt ttttcttctg ttcctttgaa atattttttc aatatttctt 74460 

ttgtaaattc actcttttgt atccatttgt taattgttga tatcctaagc tctcttccat 74520 

tatgattcta tgcatcctat ttaaaatata tagaaaatca tctcatactc tagctgtaat 74580 

ttttattaat gtgctaatag ctaataactg tcaaatctag gtctccaggc caggctctgt 74640 

atatccagct accaagagag aactccacgt ggatatcttt ggatgtctgt tttgcatctt 74700 

aaacctaact tctccaaatt tgcacttgtc ttctgtctca gacctgctgc tccttcagtg 74760 

ctctttgcct cagtagatag caccaccatc cttccattta gccagaaatc taagtattct 74820 

tcataactcc tcctctcctc attgaataaa ttaccaagat ccgttgatcc cattccttaa 74880 

atatctcttg gatctgttaa cttttctctg attttactct tgccatccat cacctctctc 74940 

ctgaaccatg accacaaacc cctaaatagc cttcctcttc ttaatcttat cctgctttac 75000 

accagtcttc acgctgaagc cagaatagtc attaagaaac acatctacag gtatcccatt 75060 

cattgccttt agaatggaat acagactcct cagcatgaca taatctctct tcaccagctt 75120 

catttattca acaaatattt attcataacc aattaagtgc cagatgatgc acatatagac 75180 

ttcttgttct gttgttgcat tgcatattcc atatttcagc tatcctgaat tgttttcaat 75240 

tattcataag ttctttatga attgtgttca ttccatttgg aatattctac cttgtttgat 75300 

cagcataaag acttttcgag acactgcagc agcagtgaac ctaaatatgt ttccttgacc 75360 
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cctacattga 
aaattacatt 
gagcttgttt 
aacaggggtt 
ttaaatttta 
ttctgtaggg 
gtgatgcatg 
tatgttctaa 
tactttaaca 
ggcaacaggt 
atgtgggagg 
atatgaaaaa 
aatatatata 
tgaacacaaa 
tggactaatt 
actggaaatc 
ttttaacata 
tgactatcac 
catcacattc 
tacaaacata 
gatttaaaaa 
gataagaaat 
cattgctttc 
caaatatagt 
atagtgatag 
tgtgtactga 
taatggtaaa 
gcagaaggta 



atgacacccc 
atactttggg 
tttcaccctt 
tatttgaatt 
ggaaaacaat 
gctacttctt 
gaaaattact 
atcttcatgt 
cagcacaagt 
agctgaatat 
ggtggagctt 
ataatttgct 
ggatagttag 
gtgattctta 
atgtgtcttt 
acattgaaat 
tgatcttaaa 
acaaccacac 
attaacaaag 
tttaaacacc 
attaccctgt 
ttgaaagttg 
taagtcacta 
tggataacgc 
gcaaacatag 
ttgataccac 
caaaatagat 
aaggaatgta 



ctgtgatatg 
gcttttattt 
ttgaaagaca 
tttaaaagaa 
tatataaagt 
tcaagacttt 
gcatatttaa 
cgtaaatagg 
cactgaagtg 
atctacagaa 
aaggtaaaaa 
aggttttaaa 
attgtatttc 
tcacattgac 
ccatgggtta 
tcacttttag 
agatatattt 
accaatttgt 
tttgataaat 
tgctattaag 
ggtcatggag 
aaggtaatag 
cttctgtgta 
atgtattaaa 
ttatattgaa 
atgccatatg 
acggtctttg 
tgtgatcatt 



tttctggaag 
aaggtatgtc 
tcaagatgcc 
gaataaagta 
gagatatgct 
agcaactttc 
agcttatctt 
tccagaaggg 
aaacttgctg 
attgaaaaat 
ataacagtta 
actaacaaaa 
tgtaatatta 
cattgacatt 
ttaaggtaat 
aggcccttaa 
ggaatgacac 
gaatgtgttt 
gactatagtc 
tataggcatt 
ctcacaatcc 
aaaattttac 
aatatggaat 
attctgtcag 
tgaaaaatta 
cttgtatagc 
tctccataga 
ggtgaagctg 



cagcaatact 
tttcctgatt 
catgatgatg 
atttttaaat 
taaattgaag 
catgtggggg 
agagctataa 
atttaaaaag 
aaaggattcc 
tggaattctt 
atatctaaat 
accatggtta 
aaactcagca 
acatggaaaa 
tgtatggcat 
aatatttctg 
aacagtttta 
ctatttcctc 
tataataaaa 
atcagatctt 
actgcaaaaa 
ctttattttt 
tgtttttcct 
cactaagttg 
gaatcaaatt 
aatacaaggt 
acttttagtc 
aacatgtata 



tcccttctta 
tacaatagta 
tcttgcatgt 
gaatttcaat 
gacaaagtag 
agtgatttat 
taaagcagct 
ccttaatcct 
ttttatgtta 
ttgctcagaa 
tgatcaagaa 
taaaggtttg 
ttaaatttaa 
aatagtcagt 
ataaatttat 
taatatatat 
tagacaggcc 
taaattaatg 
tatttttgtt 
aaaatacaaa 
taatgtttgt 
caaaatgtac 
taagatatac 
ttttttagac 
tattaaacac 
ttggaattta 
tagtgggaga 
cccaaacagt 



75420 
75480 
75540 
75600 
75660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
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tataagttcc 


aagatggaca 


ataatgggtg 


ccatagggaa 


gg a 999tacc 


aaggaaccta 


77100 


ctggaggtta 


catagggaag 


attattccaa 


ggtagtaata 


tttaagtgaa 


tatccaagga 


77160 


ataattgtca 


atcactttat 


aagtactgag 


ggaggagtat 


ttcaaaagag 


ctttgaggcg 


77220 


gaaaataaat 


tagttccttt 


atggaactaa 


tgtaaggaaa 


atactaagca 


aacatgtaat 


77280 


aagaagaaca 


cggttgatga 


gttaagaact 


gacaagatta 


ctgaaggatt 


gtaggccata 


77340 


tttagaagtt 


ggatttttta 


tctattctta 


ttaaagtgag 


aagttattga 


aaggtcttaa 


77400 


gtgggggagt 


gatgatgaag 


tttgcctttt 


aaaaaagatt 


tttctagcta 


ttgtttatag 


77460 


aatggtttga 


agatgaataa 


gtccaatagc 


tatacttgct 


gtaaaggtta 


tgttggtagc 


77520 


ttgaactggg 


gcagtggtga 


cacagaggat 


99g a 9 at 99 a 


aaatgacgag 


tgaacaaaca 


77580 


catacctgaa 


aatttaagtt 


taaaaataga 


cctctccatt 


aattcagatt 


gctgatattc 


77640 


attcggttag 


ccattcttta 


ctgaacttta 


tgatgcccca 


tatactgaat 


taaatactta 


77700 


caagcactaa 


aaaagaaatt 


gttagggaac 


agtaaaatgc 


atttccttca 


tttcacaata 


77760 


ttattaatat 


tatggctttg 


ctaatcttta 


ttggtgaatg 


cagtcataat 


tgaaggtaac 


77820 


tgatacttcc 


aaggactact 


tttgacctag 


gattactatc 


tttttaaaaa 


tttagtatta 


77880 


aagaagtcaa 


acacaattta 


ttaattctgg 


atataataaa 


aattctgaaa 


tactttaata 


77940 


ctttgtgctt 


ttctatttgt 


gaaagttaat 


tattaggaac 


gagctagcaa 


atgctacttc 


78000 


tttttcaaaa 


agctaatggc 


caatcacagc 


aaaaatttaa 


agcactaaga 


aatacctaca 


78060 


catattcttc 


tattgcccat 


ttatatgact 


tccataatag 


ttgattaaag 


gataccggat 


78120 


tcctttattg 


ttgaattaaa 


acctcctaca 


tgaaaacctt 


gatttaggtt 


tagaagttgg 


78180 


taatgttttg 


gcatgcaaaa 


ccagttaatg 


ttctcatcat 


tactttttaa 


aacaatgtta 


78240 


agagatgaat 


tctagggatt 


ataaaaaaaa 


aaaagctgta 


tgtgtttctt 


cctataaaat 


78300 


ttttcagcat 


gattgcctca 


gtagaaaaat 


taagggactt 


attgatatat 


atgtatatga 


78360 


aggtgaggat 


acacatatac 


acacacacat 


atatatgtag 


gtaaatacat 


atattacatg 


78420 


tctatcaatc 


catacatact 


catttattat 


acgttttgaa 


agcaaccagt 


tatagttttg 


78480 


ttgccatgga 


tcatttttac 


tattcagtaa 


atcagtcaat 


tgaagaggct 


tgattttatg 


78540 


gtattagttt 


tttggaaact 


gtcagcttta 


tagtaaattt 


tgacatctta 


caacttccac 


78600 


tgagattttt 


ttgcttgact 


aatctgcctt 


gatgccaata 


agtatattaa 


cggaaatgga 


78660 


ctaaaagcaa 


atgtgacttg 


aagcacaatt 


ttgtaaattt 


tcttagtgtc 


teagtaatac 


78720 


ttaatactag 


tgcattttag 


gtaggaaaat 


tttcagtttg 


ttttatttta 


aataactata 


78780 
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aatcttatag 


ttgcttgtat 


aaaagaaaca 


gataccttta 


acatgattaa 


atatcaaatg 


78840 


ctattctctt 


caaaatatct 


taactaaaga 


agcactgcct 


gctcttagaa 


gttaagcaag 


78900 


gccataccat 


atgctgcgta 


catggctttt 


aacacaatgg 


atattagaaa 


cagcctaagg 


78960 


ctgagcctgg 


ctccactatt 


tttcagctat 


gtgaccatgt 


gaaagttaca 


tttagtaatt 


79020 


aaactcattt 


cagtagtttg 


ctttaagaat 


aaaattaggt 


actccggggg 


catatcaagc 


79080 


atattgtaaa 


acctagtttg 


attattattt 


gttattggta 


ttactattac 


tattctataa 


79140 


taagtcatgg 


gcaggcagta 


ggggtacatt 


ggaagaattg 


cactgtctta 


aatatgtcct 


79200 


ctgtttaact 


cacaaactca 


gtctacctag 


gctttctttg 


gaggatctgc 


ctttcattgg 


79260 


ctgtttgact 


ttggccaagt 


tacttaactt 


cttttcactt 


cagtttcctc 


atctgtgaga 


79320 


ttatgtgctt 


acatgacttc 


aggttttgtt 


ttggctctaa 


tatggtatga 


ttctatgaaa 


79380 


tggaaagtta 


atacatttgg 


ctctagtaac 


tgtatttgaa 


gcacaaatat 


taaaaagcac 


79440 


aattaattct 


cattctgagt 


ttccatttac 


tcttttaaat 


taatcattca 


gaataaatca 


79500 


ttttggaaga 


gctgcttgat 


ccaggtattc 


agtagaaatc 


actagcatag 


catttaattt 


79560 


tagacaaaac 


tgagaactca 


ttaaactgcc 


agggctatgg 


acttatatga 


gattctcatt 


79620 


aaatcttaat 


gtagataact 


cagttaatta 


aaacaaatat 


ggttgtactt 


tattaaactt 


79680 


ctaaagtcaa 


aactgcattg 


aaattatctg 


tacaaagcct 


tgttgacctt 


tattagagaa 


79740 


ctgcctctca 


aaagacctaa 


aagacttatt 


tgttcagatc 


gagactcttc 


atgagccaat 


79800 


gtgatactct 


ccctctattg 


ctagatcttc 


gcatcagaag 


acagcattcc 


tctgaaagtg 


79860 


tttctagtat 


caacagtgcc 


acaagccatt 


ccagtattgg 


cagtggtaat 


gatgccgact 


79920 


ccaagaagaa 


gaaaaagaaa 


aactgggtaa 


gttaccatcc 


ttcatctaat 


tcagaagctt 


79980 


attaatgcat 


aatgtgttag 


gcctttttct 


ttggggcttt 


agtgatctgc 


agtagtttac 


80040 


aaagggtccc 


attcaagcta 


ctgagacctc 


aaatgctgca 


ctcatcacca 


aaattggagt 


80100 


ggcatgtact 


gaaaagcata 


cattttaatg 


ttgggactaa 


acttgggttt 


gaatcaccac 


80160 


tatatctaga 


ccttttgagg 


ggcctgaatt 


ttctaaccaa 


taaaaagaca 


gttaatagca 


80220 


actatattta 


tttgtgaata 


tcatttattc 


acagatgtta 


tctaattttt 


ctatagtata 


80280 


actatacaaa 


ctatgtagta 


taactataga 


gttatactaa 


agaaaaataa 


gataacatct 


80340 


gtgaataaat 


ggcttaaaat 


aggggtttat 


tgtgggcata 


gagatgaagg 


aaaagtgaaa 


80400 


aaatgatgat 


gatggtgatg 


atgatggtga 


tagtggtctt 


ggaggaaaag 


gagaatggga 


80460 
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gttaataaag 
ttaccgtgtg 
tttaaccaag 
tttgctagtg 
gttactgcga 
atgtagattt 
tactggagtg 
acaggtaata 
tttcatgttt 
ttggcatgga 
gcttttatac 
aatctcaaat 
ttctggtttt 
tgttaggtga 
tgattgtagc 
atggagatc 



ggaaagaata 
aatagtttca 
agaatatagc 
atataattta 
aaaatttaaa 
tgagtaaata 
ctttctacaa 
ttttaaaata 
ctgcaaggtg 
tntataaaaa 
cttttaaaac 
tttaaagaga 
gagtagtctg 
atttgatttc 
gctaaagtgg 



aacaatgaaa 
ccctaaaaga 
agaaatatag 
tgatattgat 
aacaatggag 
caactcttga 
ataatgaaag 
ctgatcagcc 
aactctagag 
aaccctacct 
agattacctt 
tgtaaaacca 
gggttggaaa 
attttgcttt 
ccatttaaaa 



ctctcattcc 
aatcaaccac 
aaatgcactt 
caatagctaa 
tcaatgattt 
tgaaatccag 
aaatatcttt 
ttcattccct 
gaagtgaggt 
tggcatgaat 
gtatgtcttt 
ctttctgaat 
atttgaatag 
ttaagtttgt 
caaattgcct 



acctttggaa 
attagtgtct 
taacagaact 
atagcccagg 
cttttaafcac 
acataattat 
ttatcttaaa 
tgatttgtaa 
gaanataaac 
gctatccatt 
tctttgtgtc 
agagctgtag 
aaaaatcaca 
actgtcagca 
tgaagagaga 



tcgacagggc 
gcttgatgtt 
gtaccttaag 
ggaagatact 
caaaaaaaaa 
cagaggattt 
aaatgtttat 
ttccacactc 
cgtggacaat 
ttggcagtag 
ttttcatttt 
gggataccaa 
attaatgaag 
ggacatgact 
agcattggga 



80520 
80580 
80640 
80700 
80760 
80820 
80880 
80940 
81000 
81060 
81120 
81180 
81240 
81300 
81360 
81369 
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